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SETS FOR MATHEMATICS 


Advanced undergraduate or beginning graduate students need a unified foundation 
for their study of mathematics. For the first time in a text, this book uses categorical 
algebra to build such a foundation, stalling from intuitive descriptions of mathemat- 
ically and physically common phenomena and advancing to a precise specification 
of the nature of categories of sets. 

Set theory as the algebra of mappings is introduced and developed as a unifying 
basis for advanced mathematical subjects such as algebra, geometry, analysis, and 
combinatorics. The formal study evolves from general axioms that express univer- 
sal properties of sums, products, mapping sets, and natural number recursion. The 
distinctive features of Cantorian abstract sets, as contrasted with the variable and 
cohesive sets of geometry and analysis, arc made explicit and taken as special ax- 
ioms. Functor categories are introduced to model the variable sets used in geometry 
and to illustrate the failure of the axiom of choice. An appendix provides an explicit 
introduction to necessary concepts from logic, and an extensive glossary provides 
a window to the mathematical landscape. 
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Foreword 


Why Sets for Mathematic si 

This book is for students who arc beginning the study of advanced mathematical 
subjects such as algebra, geometry, analysis, or combinatorics. A useful foundation 
for these subjects will be achieved by openly bringing out and studying what they 
have in common. 

A significant part of what is common to all these subjects was made explicit 
100 years ago by Richard Dedekind and Georg Cantor, and another significant paid 
50 years ago by Samuel Eilenberg and Saunders Mac Lane. The resulting idea of 
categories of sets is the main content of this book. It is worth the effort to study this 
idea because it provides a unified guide to approaching constructions and problems 
in the science of space and quantity. 

More specifically, it has become standard practice to represent an object of math- 
ematical interest (for example a surface in three-dimensional space) as a “structure.” 
This representation is possible by means of the following two steps: 

(1) First we deplete the object of nearly all content. We could think of an idealized 
computer memory bank that has been erased, leaving only the pure locations 
(that could be filled with any new data that are relevant). The bag of pure points 
resulting from this process was called by Cantor a Kardinalzahl, but we will 
usually refer to it as an abstract set. 

(2) Then, just as computers can be wired up in specific ways, suitable specific map- 
pings between these structureless sets will constitute a structure that reflects 
the complicated content of a mathematical object. For example, the midpoint 
operation in Euclidean geometry is represented as a mapping whose “value” at 
any pair of points is a special third point. 

To explain the basis for these steps there is an important procedure known as the 
axiomatic method: That is, from the ongoing investigation of the ideas of sets and 
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mappings, one can extract a few statements called axioms', experience has shown 
that these axioms arc sufficient for deriving most other true statements by pure 
logic when that is useful. The use of this axiomatic method makes naive set theory 
rigorous and helps students to master the ideas without superstition. An analogous 
procedure was applied by Eilenberg and Steenrod to the ongoing development of 
algebraic topology in their 1952 book [ES52] on the foundations of that subject as 
well as by other practitioners of mathematics at appropriate stages in its history. 

Some of the foundational questions touched on here are treated in more detail in 
the glossary (Appendix C) under headings such as Foundations, Set Theory, Topos, 
or Algebraic Topology. 


Organization 

In Chapters 1-5 the emphasis is on the category of abstract sets and on some very 
simple categorical generalities. The additional century of experience since Cantor 
has shown the importance of emphasizing some issues such as: 

(1) Each map needs both an explicit domain and an explicit codomain (not just a 
domain, as in previous formulations of set theory, and not just a codomain, as 
in type theory). 

(2) Subsets arc not mere sets with a special property but are explicit inclusion 
maps. (This helps one to realize that many constructions involving subsets are 
simplified and usefully generalized when applied appropriately to maps that 
are not necessarily subsets.) 

(3) The algebra of composition satisfies the familiar associative and identity rules; 
other basic concepts, such as “belonging to” (e.g., membership in, and inclu- 
sion among, subsets) and the dual “determined by” are easily expressible as 
“division” relative to it. It turns out that this adherence to algebra (showing that 
“foundation” does not need a language distinct from that of ordinary mathe- 
matics) has broad applicability; it is particularly appropriate in smoothing the 
transition between constant and variable sets. 

(4) Because functionals play such a key role in mathematics, the algebra is explicitly 
strengthened to include the algebra of evaluation maps and induced maps. 

All of these issues are elementary and quite relevant to the learning of basic 
mathematics; we hope that mathematics teachers, striving to improve mathematics 
education, will take them to heart and consider carefully the particular - positive role 
that explicit formulations of mathematical concepts can play. 

Beginning in Chapter 6, examples of categories of cohesive and variable sets are 
gradually introduced; some of these help to objectify features of the constant sets 
such as recursion and coequalizers. 
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xi 


We illustrate the use of the maximal principle of Max Zorn in Appendix B, and 
we include a proof of it in the form of exercises with hints. Several other results that 
do not hold in most categories of valuable or cohesive sets, such as the Schroeder- 
Bernstein theorem and the total ordering of sizes, arc treated in the same way. 
Despite our axiomatic approach, we do not use the internal language that some 
books on topos theory elaborate; it seemed excessively abstract and complicated 
for our needs here. 

Appendix A presents essentially a short course in “all that a student needs to 
know about logic.” Appendix B, as mentioned, briefly treats a few of the special 
topics that a more advanced course would consider in detail. Appendix C provides 
a glossary of definitions for reference. Some of the glossary entries go beyond bare 
definition, attempting to provide a window into the background. 

Some exercises are an essential paid of the development and are placed in the 
text, whereas others that arc optional but recommended are for further clarification. 

F. William Lawvere Robert Rosebrugh 

June 2002 
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Abstract Sets and Mappings 


1.1 Sets, Mappings, and Composition 

Let us discuss the idea of abstract constant sets and the mappings between them 
in order to have a picture of this, our central example, before formalizing a math- 
ematical definition. An abstract set is supposed to have elements, each of which 
has no structure, and is itself supposed to have no internal structure, except that the 
elements can be distinguished as equal or unequal, and to have no external structure 
except for the number of elements. In the category of abstract sets, there occur sets 
of all possible sizes, including finite and infinite sizes (to be defined later). It has 
been said that an abstract set is like a mental “bag of dots,” except of course that 
the bag has no shape; thus, 



may be a convenient way of picturing a certain abstract set for some considerations, 
but what is apparently the same abstract set may be pictured as 



for other considerations. 
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Abstract Sets and Mappings 


What gives the category of sets its power is the concept of mapping. A mapping 
/ from an abstract set A to an abstract set B is often explained through the use of 
the word value. (However, since the elements of B have no structure, it would be 
misleading to always think of these values as quantities.) Each mapping / from A 
to B satisfies 

for each element x of A 
there is exactly one element y of B 
such that y is a value of/ at x 

This justifies the phrase “the value”; the value of / at x is usually denoted by 
/(x); it is an element of B. Thus, a mapping is single-valued and everywhere defined 
(everywhere on its domain) as in analysis, but it also has a definite codomain (usually 
bigger than its set of actual values). Any f at all that satisfies this one property is 
considered to be a mapping from A to B in the category of abstract constant sets; that 
is why these mappings arc referred to as “arbitrary”. An important and suggestive 
notation is the following: 

f 

Notation 1.1: The arrow notation A — ->■ B just means the domain of f is A and 
the codomain of f is B, and we write dom(f) = A and cod(f) = B. (We will 
usually use capital letters for sets and lowercase letters for mappings.) For printing 
convenience, in simple cases this is also written with a colon / : A — >- B . We 
can regard the notation / : A — >- B as expressing the statement dom(f) = A & 
cod(f) = B , where & is the logical symbol for and. 

For small A and B. a mapping from A to B can be pictured using its cograph or 
internal diagram by 



where f{x) is the dot at the right end of the line that has x at its left end for each 
of the three possible elements x . 

Abstract sets and mappings are a category, which means above all that there is 
a composition of mappings, i.e., given any pair / : A — >- B and g : B — =- C there 
is a specified way of combining them to give a resulting mapping go/: A — >C. 
Note that the codomain set of the first mapping / must be exactly the same set 
as the domain set of the second mapping g. It is common to use the notation o 
for composition and to read it as “following,” but we will also, and much more 
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often, denote the composite “g following /” just by gf. A particular instance of 
composition can be pictured by an external diagram or by an internal diagram as 
below. First consider any three mappings f,g, and m with domains and codomains 
as indicated: 



The internal cograph diagrams express the full information about particular maps, 
which is often more than we need; thus, we will use simple, external diagrams 
wherever possible. 

Since any mapping satisfies restrictions of the kind “for each . . . there is exactly 
one . . . in the diagram above, we observe that 

• for each element a of A there is exactly one element b of B for which b is a value 
of / at a (briefly f(a) — b)\ 

• for each element b of B there is exactly one element c of C for which c is a value 
of g at b (briefly g{b) = c); 

• for each element a of A there is exactly one element c of C for which c is a value 
of m at a (briefly m(a ) = c). 

The external diagram above is said to be a “commutative diagram”, if and only if 
m is actually the composite of g following /; then, notationally, we write simply 
m = gf. 

More precisely, for the triangular diagram to be considered commutative, the 
relation between /, g, m must have the following property: 

For each element a of A we can find the value of m(a) by proceeding in two 
steps: first find f(a) and then find y (/(«)); the latter is the same as m{a). 

(Examining the internal diagram shows that m = gf in the figure above.) 

A familiar example, when A = B = C is a set of numbers equipped with struc- 
tural mappings providing addition and multiplication, involves f(x) = x 2 and 
g(x) = x + 2 so that (g o f)(x) = x 2 + 2. The value of the composite mapping 
at x is the result of taking the value of g at the value of / at x . In contexts such as 
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this where both multiplication and composition are present, it is necessary to use 
distinct notations for them. 

Exercise 1.2 

Express the mapping that associates to a number x the value V x 2 + 2 as a composite 
of three mappings. 0 

We need to be more precise about the concept of category. The ideas of set, map- 
ping, and composition will guide our definition, but we need one more ingredient. 
For each set A there is the identity mapping 1 A : A — 5 - A whose values are deter- 
mined by 1 A (x) = x. For any set A, this definition determines a particular mapping 
among the (possibly many) mappings whose domain and codomain are both A. 

On the basis of the preceding considerations we have paid of the information 
required to define the general notion of “category”. The first two items listed corre- 
spond to abstract sets and arbitrary mappings in the example of the category of sets. 
A category C has the following data: 

• Objects: denoted A, B , C, . . . 

• Arrows: denoted f, g,h, ... (arrows are also often called morphisms or maps) 

• To each arrow / is assigned an object called its domain and an object called its 
codomain (if / has domain A and codomain B , this is denoted / : A — >■ B ) 

• Composition: To each / : A — 5 - B and g : B — >- C there is assigned an arrow 
gf : A — s-C called “the composite of f and g” (or “g following /”) 

• Identities: To each object A is assigned an arrow 1 A : A — >■ A called “the identity 
on A”. 


1.2 Listings, Properties, and Elements 

We have not finished defining category because the preceding data must be con- 
strained by some general requirements. We first continue with the discussion of 
elements. Indeed, we can immediately simplify things a little: an idea of element 
is not necessary as a separate idea because we may always identify the elements 
themselves as special mappings. That will be an extreme case of the parameterizing 
of elements of sets. Fet us start with a more intermediate case, for example, the 
set of mathematicians, together with the indication of two examples, say Sir Isaac 
Newton and Gottfried Wilhelm Feibniz. Mathematically, the model will consist 
not only of an abstract set A, (to stand for the set of all mathematicians) but also 
of another abstract set of two elements 1 and 2 to act as labels and the specified 
mapping with codomain A whose value at 1 is “Newton” and whose value at 2 is 
“Feibniz”. The two-element set is the domain of the parameterization. 

Such a specific parameterization of elements is one of two kinds of features of 
a set ignored or held in abeyance when we form the abstract set. Essentially, all of 
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the terms - parameterization, listing, family - have abstractly the same meaning: 
simply looking at one mapping into a set A of interest, rather than just at the one 
set A all by itself. 

Whenever we need to insist upon the abstractness of the sets, such a preferred 
listing is one of the two kinds of features we arc abstracting away. 

The other of the two aspects of the elements of an actual concrete aggregation 
(which are to be ignored upon abstraction) involves the properties that the elements 
might have. For example, consider the set of all the mathematicians and the property 
“was born during the seventeenth century” that some of the mathematicians have and 
some do not. One might think that this is an important property of mathematicians 
as such, but nonetheless one might momentarily just be interested in how many 
mathematicians there arc. 

Certain properties are interpreted as particular' mappings by using the two- 
element set of “truth values” - true, false - from which we also arrive (by the 
abstraction) at the abstract set of two elements within which “true” could be taken 
as exemplary. If we consider a particular mapping such as 



we see that all those elements of A that go to “true” will constitute one portion of A, 
and so / determines a property “true” for some elements, and “not true,” or “false,” 
for others. There are properties for which the codomain of / will need more than 
two elements, for example, age of people: the codomain will need at least as many 
elements as there are different ages. 

As far as listing or parameterizing is concerned, an extreme case is to imagine that 
all the elements have been listed by the given procedure. The opposite extreme case 
is one in which no examples of elements are being offered even though the actual 
set A under discussion has some arbitrary size. That is, in this extreme case the 
index set is an empty set. Of course, the whole listing or parameterization in this 
extreme case amounts really to nothing more than the one abstract set A itself. 

Just short of the extreme of not listing any is listing just one element. We can do 
this using a one-element set as parameter set. 

domain codomain 
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To characterize mathematically what the one-element set is, we will consider it 
in terms of the property that does not distinguish. The following is the first axiom 
we require of the category of sets and mappings. 

AXIOM: TERMINAL SET 

There is a set 1 such that for any set A there is exactly one mapping A — *- 1. This 
unique mapping is given the same name A as the set that is its domain. 

We call 1 a terminal object of the category of sets and mappings. There may 
or may not be more than one terminal object; it will make no difference to the 
mathematical content. In a given discussion the symbol 1 will denote a chosen 
terminal object; as we will see, which terminal object is chosen will also have no 
effect on the mathematical content. 

Several axioms will be stated as we proceed. The axiom just stated is paid of the 
stronger requirement that the category of sets and mappings has finite inverse limits 
(see Section 3.6). A typical cograph picture is 


A -1 



Only a one-element set V = 1 can have the extreme feature that one cannot detect 
any distinctions between the elements of A by using only “properties” A — 5 - V . 
Having understood what a one-element set is in terms of mapping to it, we can now 
use mappings from it to get more information about arbitrary A. 

Definition 1.3: An element of a set A is any mapping whose codomain is A and 
whose domain is 1 ( or abbreviated ... 1 — ^ A ). 

(Why does 1 itself have exactly one element according to this definition?) 

The first consequence of our definition is that 

element is a special case of mapping. 

A second expression of the role of 1 is that 

evaluation is a special case of composition. 

In other words, if we consider any mapping / from A to B and then consider any 
element a of A, the codomain of a and the domain of / are the same; thus, we can 



1.2 Listings, Properties, and Elements 


7 


form the composite fa. 


A 



which will be a mapping 1 - — §- B. But since the domain is 1, this means that fa 
is an element of B. Which element is it? It can only be, and clearly is, the value of 
/ at a: 


A 



That is, if a is an element, fa = f(a). 

Finally, a third important expression of the role of 1 is that 

evaluation of a composite is a special case of the 
Associative law 

of composition (which will be one of the clauses in the definition of category). In 
order to see this, suppose m — gf and consider 



The formula (in which we introduce the symbols V to mean “for all” and => to 
mean “implies”) 

m = gf => [Va[l A =>• m(a) = g(fa)]\ 

expresses our idea of evaluation of the composition of two mappings; i. e. if m is 
the composite of / and g, then for any element a of the domain of / the value of 
m at a is equal to the value of g at f(a). More briefly, ( gf)a = g{fa), which is a 
case of the associative law. 

The three points emphasized here mean that our internal pictures can be (when 
necessary or useful) completely interpreted in terms of external pictures by also 
using the set 1 . 

Notice that the axiom of the terminal set and the definition of element imply 
immediately that the set 1 whose existence is guaranteed by the axiom has exactly 
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one element, namely, the unique mapping from 1 to 1 . There is always an identity 
mapping from a set to itself, so this unique mapping from 1 to 1 must be the identity 
mapping on 1 . 

We want to introduce two more logical symbols: the symbol 3 is read “there ex- 
ists,” and 3 ! is read “there exists exactly one”. Thus, we can repeat the characteristic 
feature of every / : A — >- B as follows: 

Wa : 1 — A 3 ! b : 1 — >■ B [b is a value of / at a] 

But this is a special case of the fact that composition in general is uniquely defined. 

1.3 Surjective and Injective Mappings 

Recall the first internal diagram (cograph) of a mapping that we considered: 



Note that it is not the case for the / in our picture that 

for each element b of B 
there is an element x of A 
for which b is the value of/ at x (fix) = b) 

Definition 1.4: A mapping f : A — =- B that has the existence property “for each 
element b of B there is an element x of A for which b — fix) ” is called a surjective 
mapping. 

Neither is it the case that the / in our picture has the property 

for each element b of B 
there is at most one element x of A 
for which fix) = b 

Definition 1.5: A mapping f : A — B that has the uniqueness property “given 
any element b of B there is at most one element r of A for which fix) — b” is 
called an injective mapping. In other words, if f is an injective mapping, then for 
all elements x, x' of A, if fix) = fix'), then x = x' . 


Definition 1.6: A mapping that is both surjective and injective is called bijective. 
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Thus, the / pictured above is neither surjective nor injective, but in the figure 
below g : A — >- B is an injective mapping from the same A and to the same B. 



Exercise 1.7 

Is the pictured g surjective? 0 

Exercise 1.8 

Are there any surjective mappings A — B for the pictured A, B1 0 

Exercise 1.9 

How many mappings from the three-element set A to the seven-element set B arc 
there? Can we picture them all? 0 

Exercise 1.10 

Same as 1.9, but for mappings B — >■ A from a seven-element to a three-element 
set. 0 

Exercise 1.11 

Are there any surjective B — >- A 1 Are there any injective ones? 0 

Exercise 1.12 

What definition of “/i ^ fi” is presupposed by the idea “number of” mappings 
we used in 1.9 and 1.10? 0 

Exercises 1.9 and 1.12 illustrate that the feature “external number/internal in- 
equality of instances” characteristic of an abstract set is also associated with the 
notion “mapping from A to Bj' except that the elements (the mappings) arc not 
free of structure. But abstractness of the sets really means that the elements arc for 
the moment considered without internal structure. By considering the mappings 
from A to B with their internal structure ignored, we obtain a new abstract set B A . 
Conversely, we will see in Chapter 5 how any abstract set F of the right size can 
act as mappings between given abstract sets. (For example, in computers variable 
programs are just a particular' kind of variable data.) 
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1.4 Associativity and Categories 

Recall that we saw in Section 1 .2 that an “associative law” in a special case expresses 
the evaluation of composition. Indeed, whenever we have 

1 -U. A M- B U- C 

then we have the equation ( gf)(a ) — g(fa). 

If we replace a by a general mapping u : T — ^ A whose domain is not necessarily 
1 , we obtain the Associative law 

(gf)u = g(fu) 

which actually turns out to be true for any three mappings that can be composed; 
i.e., that from the commutativity of the two triangles below we can conclude that 
moreover the outer two composite paths from T to C have equal composites (it is 
said that the whole diagram is therefore “commutative”). 



{gf)u = g{fu) 



Since the 1 among abstract sets has the special feature (which we discuss in 
Section 1.5) that it can separate mappings, in abstract sets the general associative 
law follows from the special case in which T = 1. 

An important property of identity mappings is that they not only “do nothing” to 
an element but that they have this same property with respect to composition. Thus, 
if 1 a ■ A — 5 - A and 1 B : B — >■ B arc identity mappings, then for any / : A — >- B 
we have the equations 

fU = f=l B f 

With these ideas in hand we arc ready to give the completed definition of category. 
The beginning of our specification repeats what we had before: 

Definition 1.13: A category C has the following data: 

• Objects: denoted A, B,C, . . . 

• Allows: denoted f, g,h, . . . (arrows are also often called morphisms or maps) 

• To each arrow / is assigned an object called its domain and an object called its 
codomain (if / has domain A and codomain B , this is denoted / : A — >- B or 
A^B) 

• Composition: To each / : A — > B and g : B — >- C, there is assigned an arrow 
gf : A C called “the composite g following /” 

• Identities: To each object A is assigned an arrow I ; : A — *- A called 

“the identity on A”. 
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The data above satisfy the axioms 

• Associativity: if A — B C — ^ D, then h(gf ) = ( hg)f 

• Identity: if / : A — z*B, then f = fl A and / = 1 B f. 

As we have been emphasizing, 

AXIOM: 5 IS A CATEGORY 

Abstract sets and mappings form a category ( whose objects are called sets, and 
whose arrows are called mappings). 

This is the basic axiom implicit in our references to the “category of abstract sets 
and mappings” above. There arc many other examples of categories to be found in 
mathematics, and a few of these are described in the exercises in Section 1.8 at the 
end of the chapter. 


1.5 Separators and the Empty Set 


If a pair of mappings 



h 


has the same domain and has the same codomain (i.e., they arc two mappings that 
could be equal), then we can discover whether they are really equal by testing with 
elements 


(Vx[l A =>• fix = f 2 x\) =>• /i = h 

i.e., if the value of f\ equals the value of f 2 at every element x of A, then f] — f 2 . 
This is one of the ways in which we can conclude that f\ = f 2 . The converse 
implication of the statement is trivial because it is merely substitution of equals for 
equals (a general idea of mathematics). But the indicated implication is a special, 
particularly powerful feature of one-element abstract sets. In its contrapositive form 
it states: If f\ f f 2 , then there exists at least one element x at which the values of 
fi and f 2 are different. (This is the answer to Exercise 1.12!) For a category C an 
object with this property is called a separator. 


Definition 1.14: An object S in a category C is a separator if and only if whenever 



h 


are arrows ofC then 


(Vx [5 


A =>• /jx = / 2 x]) 


h = h 
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As mentioned in 1.4 the property we have been describing is required of the 
terminal object 1 as a further axiom in the category of abstract sets and arbitrary 
mappings. It is a powerful axiom with many uses; it is special to the category S 
of abstract sets and will not hold in categories of variable and cohesive sets where 
more general elements than just the “points” considered here may be required for 
the validity of statements even analogous to the following one (see Section 1.6): 

AXIOM: THE TERMINAL OBJECT 1 SEPARATES MAPPINGS IN 5 
A one-element set 1 is a separator in S, i. e . , if 

X^Y 

h 

then 


(V* [1 X => f\x = f 2 x\) =>• fi = fi 


Exercise 1.15 

In the category of abstract sets S, any set A with at least one element 1 — ^ A is 
also a separator. (When an exercise is a statement, prove the statement.) 0 

We return to the extreme case of listing or parameterization in which no elements 
are listed. In this case there cannot be more than one listing map (we will use “map” 
and “mapping” synonymously!) into A since the indexing set we are trying to use 
is empty. On the other hand, there must be one since the statement defining the 
property of a mapping is a requirement on each element of the domain set (that 
there is assigned to it a unique value element in the codomain). This property 
is satisfied “vacuously” by a mapping from a set without elements since there is 
simply no requirement. Thus, there exists a unique mapping from an empty set to 
any given set. We require such a set as an axiom. 

AXIOM: INITIAL SET 

There is a set 0 such that for any set A there is exactly one mapping 0 — > A. 

We call 0 an initial object of the category of sets and mappings. 

Note that the form of this axiom is the same as the form of the axiom of the 
terminal set, i.e. we require the existence of a set and a unique mapping for every 
set except that the unique mapping is now to the arbitrary set whereas formerly 
it was from the arbitrary set. Like the axiom of the terminal set, the axiom of the 
initial set will become part of a stronger axiom later. The initial set is often called 
the empty set because, as we will later see, there are no maps 1 — M). 
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Exercise 1.16 

In the category of abstract sets S the initial set 0 is not a separator. (Assume that 
two sets A and B exist with at least two maps A — >- B.) 0 


ADDITIONAL EXAMPLES: 

(1) If T is an index set of numbers, then 

T — X 

could be the listing of all the American presidents in chronological order. It 
does turn out that the map is not injective - Cleveland was both the 22nd and 
the 24th president. 

If we want to ask who was the 16th president, the structure of the question 
involves all three: the actual set, the actual listing, and a choice of index: 



Xi=xi 


Lincoln derives by composing the index i — 16 and the list x of presidents. 

(2) There are at least two uses of two-element sets: 

Index sets and truth-value sets 

Consider 

set of 
all 

tennis 
players 

1 = “best”, 2 = “second best” 

The one that used to be the second-best tennis player could become the best; 
encode that by noting that there is an endomapping (or self-mapping) r that 
interchanges the two denominations. The list f that is correct today can be the 
reverse of the list / that was true yesterday if an “upset” match occurred; i.e. 
we could have f = ft. 

A similar sort of thing happens also on the side of the possible properties of 
the elements of X : 



X 
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In this case we could also compose with the r, but now it would instead be r 
following / (which is written r /). This is called logical negation since it transforms 
/ into not-f i.e. ( not-f){x ) = not-f (x). The composite property is the property of 
not having the property /. Often in the same discussion both reparameterization of 
lists and logical or arithmetic operations on properties occur, as suggested in the 
following diagram: 



*V 



v 


V' 


If we have a list x of elements and a property /, then the composite fx can be 
thought of in two equally good ways. Because V represents values, we can think of 
this fx as just a property of elements of T ; for example, given the listing x of the 
presidents, the property / of their being Democrats becomes a property of indices. 
But fx could also be considered as a list (of truth values). The two concepts thus 
reduce to the same in the special case T — >■ V, giving 


LIST 

TRUTH VALUES 

PROPERTY 

INDICES 

or 

of or = 

or 

of or 

FAMILY 

QUANTITIES 

MEASUREMENT 

PARAMETERS 


Of course, the words for T (indices/parameters) and the words for V (truthval- 
ues/quantities) only refer to structure, which is “forgotten” when T, V are abstract 
sets (but which we will soon “put back in” in a more conscious way); we mention 
this fact mainly to emphasize its usefulness (via specific x and /) even when the 
structure forgotten on X itself was of neither of those kinds. 


Exercise 1.17 

Consider 

5 = Set of socks in a drawer in a dark room 
V — {white, black} 

/ — color 

How big must my “sampler” T be in order that for all injective x, fx is not 
injective (i.e., at least two chosen socks will be “verified” to have the same color)? 
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objective 


subjective 


S 



T 


f= color 


v 


0 


1.6 Generalized Elements 

Consider the following three related statements (from Sections 1.2 and 1.4): 

(1) Element is a special case of mapping; 

(2) Evaluation is a special case of composition; 

(3) Evaluation of a composite is a special case of the associative law of composition. 


Statement (2) in one picture is 


1 



(that is to say, fa = b) in which a , b arc elements considered as a special case of 
the commutativity of the following in which a, b are general mappings: 


T 



“Taking the value” is the special case of composition in which T is taken to be 1. 

For statement (3), recall that the associative law applies to a situation in which 
we have in general three mappings: 


T g(/ a ) = (g/) a 

A 

sf 

We can compute the triple composite in two ways: We can either form fa and 
follow that by g, getting g{fa), or we can first form gf (g following /) and consider 
a followed by that, obtaining what we call (g f ja ; the associative law of composition 
says that these are always equal for any three mappings a,f,g. 
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In a special case, where 7=1, the description would be that the value of the 
composite gf at an element a is by definition the element of C that we get by 
successive evaluation of / and then of g, leading to the same picture 


g(fa) = ( gf)a 



but one that is special because it emanates from the one-element set. 

In many cases we will actually want to reverse this specialization procedure; that 
is, by the phrase 

an element of A 

we often actually mean a mapping to A from any 7 (not only 7 = 1). In case of 
confusion we may refer to this as 

a generalized element of A 

A generalized element of A is actually nothing but an arbitrary mapping whose 
codomain happens to be A. It has some domain, but we do not insist that that 
domain be 1 . 

A more accurate description in terms of the content would be 
variable element of A varying over 7 

This is a very ancient way of using the idea of element; for example, if we consider 
the way people talk about temperature, they say the temperature, which seems to 
be an element of the set of temperatures, and intend by it the actual temperature. 
On the other hand, yesterday it was one value, and today it is another value. It is 
varying, but it is still the. Just think of 

7 as the set of days 

for every day there is an element in the constant sense. The actual temperature on that 
day is the value of a mapping; the mapping itself is the temperature the weatherman 
is talking about. It is varying, but it is still considered as one entity, one “element”. 
We speak this way about many other situations. “I am a different person today than 
I was yesterday, yet I am still the same person.” To explain anything that involves 
that sort of variation, one very common kind of model for it will involve somehow 
(not as its only ingredient by any means but as one ingredient) 

• an abstract set that plays the role of the “temporal” parameters, 

• another one that plays the role of values, and 

• a specific mapping that describes the result of the evolution process. 
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The process itself has some cause that may also have to be mentioned, but the result 
of that cause will be a succession of values; change is often described that way. 
But we will always correctly persist in attaching the definite article “the” as in “the 
temperature,” “the me” to express the underlying unity. An element of the set of 
temperatures might very well be 

(a) one single unchanging value of temperatures; 
equally well it might be 

(b) the temperature of Milan during one week, 

in which case it is still one (generalized) element of the set of temperatures, but its 
domain is T . 

Variable sets also occur in real life; for example, the set of people inhabiting a 
particular house may vary from year to year. To mathematically model such situ- 
ations, categories of variable sets are useful. In this chapter we are emphasizing 
the category in which the sets themselves are constant, but later we will explicitly 
describe and construct examples of categories in which the sets are (even continu- 
ously) variable. 


1.7 Mappings as Properties 

What about the mappings that have domain A? They certainly arc not elements of 
A in the usual sense; they could be called properties of (elements of) A. 

To deal first with some trivial cases, let us review the definition of 1 and 0. The 
characterizing features of an empty set 0 is that 

for each set A there is just one mapping 0 — >■ A, 

whereas the characterizing features of a terminal set 1 is 

for each set A there is just one mapping A — a- 1. 

The descriptions of 0 and 1 look rather similar except that the arrows arc reversed, 
but we will see that these objects are quite different when we start talking about 
mappings from 1 and into 0 such as 

1 — 9 - B versus A — >■ 0 

Whereas mappings from 1 — =- B exist for any B that is nonempty - and there will be 
many of them (depending on the size of B) - by contrast A — 9 - 0 will exist only if 
A is also empty, but even then there is only one map. 

So the sets whose existence is required by the axioms of the terminal set and the 
initial set should be very different! To allow 0=1 would result in all sets having 
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only one (narrow sense) element. We must clearly avoid that. For sets A and B. 
the notation A = B , which is read “A is isomoiphic to B,” means that there are 
mappings / : A — B and g : B — 5- A satisfying gf — 1 A and fg — lg . We will 
have much more to say about this concept later (see Section 3.2), but for now we 
use it (in a negative way) in the following: 

AXIOM: NONDEGENERACY OF S 

0 ^ 1 


Exercise 1.18 

How many mappings are there from 0 to 1 ? from 0 to 0? from 1 to 0? (and so how 
many elements does 0 have?) 

Hint: To answer the third question requires more axioms than do the other two. 

0 

Notice that we could have taken the apparently stronger statement “there is 
no mapping from 1 to 0” instead of the axiom as stated. The stronger statement 
certainly implies the axiom, but as shown by the exercise the axiom implies the 
stronger statement too. 

We are considering the question. If V is a fixed set of values, then for any A, 
what kind of properties A — »- V can there be? (“Property” is being used in such a 
general sense that it means just an arbitrary mapping but from the dual point of 
view to that of generalized elements.) 

If V is 0, we see that the answer to the preceding question is “none,” unless A is 
itself empty, and in that case only one. If V = 1 , there is exactly one mapping for any 
A. We have to take a V that is somewhat larger if we want any interesting property 
of (elements of) A at all. Thus, we will take a two-element set for V and see that 
we in fact get enough properties to “discriminate” between the elements of any A. 

The ability of “discriminating” elements by means of V -valued properties is fre- 
quently expressed by the following: 

Definition 1.19: An object V is a coseparator if for any A and for any parallel 
pair 


«i 


(of “generalized elements”) 
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Notice that if V is a coseparator, then ao f a\ entails that there is A — V with 
<pa 0 f <pci\, i.e. V can discriminate elements. By what we have said here, neither 
V — 0 nor V — 1 can coseparate. 

Exercise 1.20 

Use the fact that 1 is a separator in the category of abstract sets to show that (in 
that category), if V can discriminate elements, then it can discriminate generalized 
elements. Thus, in S the general T in Definition 1.19 could safely be replaced by 
T = 1. 0 

Claim 1.21: IfV is a set with exactly two elements, call it 2, then V is a coseparator 
in the category of abstract sets. 

Remark 1.22: We are assuming that there exists a set 2 with exactly two elements: 

1 §;2. We will make this existence more precise in Chapter 2. We also denote 

by r : 2 — 2 the mapping completely defined by r(0) = 1 and r (1) = 0. 

Here is a provisional justification of Claim 1.21 based on our naive conception 
of V = 2; soon we will have enough axioms to formally prove it. Assume that A 

do 

is arbitrary but that there are maps 1 £l A with ao f «i - We must “construct” a 

( p ^1 

A 2 with tpao f ipa\ showing that 2 coseparates (which follows by using the 
result of Exercise 1.20). That this is possible is quite obvious from a picture, as 
follows. If A is an arbitrary abstract set mapped into the two element set by <p 



then A is divided in 2 parts. Conversely, any mutually exclusive and jointly ex- 
haustive division of A into two parts arises from a mapping to the two-element set. 
(These arc called either indicator functions or characteristic functions in probability 
theory, combinatorics, and other subjects.) 

The <p is the indicator or characteristic function of the whole partition in two parts 
and would be determined by such a pattern. In our problem there is no specific 
partition. We merely want to show that a partition can be chosen with the two 
elements in different parts. This is a very weak condition on <p, so there are many 
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such mappings; in the extreme we could imagine that only no goes to one value, or in 
the opposite extreme that only ci\ goes to the other value. Both are extreme concepts 
of tp, and both would satisfy the requirement, namely, that the composites arc still 
different. We can make that more specific: We insist that one of the composites 
comes out to be the element 0 and the other one the element 1, where we have 
structured the abstract set 2 by considering both of its elements as being listed: 


1 



<p(a 0 ) = 0^1 = <p(ai) 


We can simply insist that the indexing is preserved by tp. Among these maps there 
is a sort of lowest possible choice for cp: 


<P(a) = 


1 if a = fli 
0 if a ^ ct\ 


which is one possibility, and a highest possible choice for tp. 


tp(a) = 


1 if a ^ «o 
0 if a = no 


at the other extreme. 

Both of these extreme cases of tp separate ciq, ti\ . There will be many partitions 
tp in between these two that will also separate no, au but since there exists at least 
one, we have convinced ourselves that 2 is a coseparator. 

How many properties are there exactly? 


Exercise 1.23 

Consider a two-element set and a three-element set: 



Name the elements of A by no and ni and the elements of B by bo, b \ , b^_. Draw the 
cographs of all mappings / : A — >■ B and all g : B — >■ A. How many mappings 
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are there from A — >■ B1 (We should find that there are nine.) How many from 
B — >■ A ? (The answer is eight = 2 1 * 3 * .) Notice that if each cograph were reduced 
to a dot we would obtain abstract sets of mappings with nine and eight elements; 
these abstract sets can in turn be used to paramaterize the actual mappings (see 
Chapters 5 and 7). 0 

Exercise 1.24 

What about mappings from a five-element set into a two-element set? How many 
mappings are there from 

X 


Answer: 2 5 = 32 mappings 0 

In general, we can say that the number of (two-valued) properties on a set with n 
elements is 2". One of the ideas of Cantor was to give meaning to this statement 
even when n is not finite, as we shall see in more detail later. 

The coseparating property of 2 is phrased in such a way that it is really just a dual 
of the separating property that 1 has, i.e. there arc enough elements to discriminate 
properties just as there arc enough properties to discriminate elements. 

Consider the various elements in, and the various 2-valued properties on the set A : 

1 : A : 2 

a’s cp’s 

We test whether ciq, ci\ are different by using such properties as follows: 

«o ^ ci\ =$■ 3<p such that (pao ^ tpa\ 

For each pair we find a <p\ for another pair we might need another cp. The dual 
statement 



(p\ ^ (p 2 =>• at which <p\a ^ cp 2 Ci 

that there are enough elements, distinguishing between properties with any kind 
V of value specializes to a statement about two-valued properties: <p\ ^ <pj means 
that two mappings arc different, but for mappings to be different there must be 
some element a at which their values are different. (Again we warn the reader that 
in categories of less abstract sets, these separating and coseparating roles will be 
taken by families of objects richer than 2 and 1.) 
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Recall that the map A : A — »: 1 is unique. The statement “f is constant” means 
that there exists a single element of B such that / is the composite: the single 
element following the unique map. i.e. 



Definition 1.25: An arrow A — B is constant if and only if it factors through 1; 
i.e. there is an element b : 1 — 5 - B of B such that f = b A : A — »- 1 — 5 - B . 

In particular, if we form the composites of the unique map to 1 with the two 
elements of 2, we get two constant maps 2 — §N2: 


o 2 1 0 o o 2 1 1 o 

2 ^1 ^2 2 ^1 ^2 



We will find various circumstances in which it is useful to look at the system of 
self-maps of 2 in its own right, not just as paid of the category of abstract sets but 
as a scheme for picking out another, more interesting kind of set called reversible 
graph (see Section 10.3). For this and other reasons it is important to know how 
these four maps compose. We can write down the “multiplication table” for this 
four-map system. The self-maps of any given object can always be composed, and 
having names for them (four in this case), we can show, in the multiplication table, 
what is the name of the composite of any two of them taken in a given order. Here 
is the resulting table: 


h 

0 

1 

r 

0 

0 

0 

0 

1 

1 

1 

1 

r 

1 

0 

h 


The composing of constants (which comes up in computing this table) works in 
general as follows: 

If / is constant with constant value b, then for any g, gf is constant but with 
constant value gb 
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A — —5 — C 



Similarly, if / is constant, then for any mapping a, fa is also constant with the 
same constant value as /: 


T — A — J -^ B 



Exercise 1.26 

Verify the composition table for endomappings of 2 by considering all four possi- 
bilities for each of the two arrows in 2 — >■ 2 — 2 and computing the composite. 

0 

Exercise 1.27 

Show how this four-element system acts as “logical operators” on the properties 
A — a- 2 of (elements of) any A but also acts in another way on the lists 2 — A 
(or ordered pairs) of elements of A. 

Hint: Recall the discussion of tennis players in Section 1.5. 0 

1.8 Additional Exercises 


Exercise 1.28 

Uniqueness properties: 

(a) Show that identity arrows in a category are unique, i.e. if 1 A : A — 5- A and 
\' A : A — 9- A both satisfy the equations for identity arrows, then 1 ,1 = \' A . 

(b) Show that the set 1 is unique “up to unique isomorphism,” (i.e. if 1' also satisfies 
the axiom of the terminal set then there arc unique mappings / : 1 — 9- Y and 
g\Y — 9- 1) and that gf = 1 1 and fg = ly. 

(c) Similarly, show that the empty set 0 is unique up to unique isomorphism. 

Exercise 1.29 

(a) Show that if / : A — 5- B and g : B — 9- C are injective mappings, then so is gf, 
i.e. “composites of injectives are injective”. 

(b) Show that composites of surjectives arc surjective. 
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Exercise 1.30 

Categories of structures: 

(a) Show that (finite-dimensional) vector spaces as objects and linear transforma- 
tions as arrows form a category with composition of arrows defined by compo- 
sition of mappings. Show that a one-dimensional space S is a separator; is it a 
coseparator? Is the terminal object a separator? 

(b) Show that groups as objects and group homomorphisms as arrows form a 
category with composition of arrows defined by composition of mappings. 
Show that the additive group of integers is a separator. (Look up the definition 
of “group” if necessary. We will discuss groups further in Section 10.1.) 

(c) Show that partially ordered sets as objects and monotone (= order-preserving) 
mappings as arrows form a category with composition of arrows defined by 
composition of mappings. Is the terminal object a separator in this category? 
Is there a coseparator? (Look up the definition of “partially ordered set” if 
necessary. We will discuss partially ordered sets further in Section 10.1.) 

(d) A finite state machine with two inputs a and f is a finite set Q of “states” and 
a pair of endomaps of Q corresponding to a and f> that effect the elementary 
state transitions of which the machine is capable. Given another such machine 
with state set Q' , a homomoiphism is a mapping tp : Q — >■ Q' of state sets 
preserving the transitions in the sense that 

cpa — a'tp and (pfi = f>\ p 

where we have denoted by a’ and f>' the corresponding state transitions of 
the second machine. Define composition of homomorphisms and show that a 
category results. 

Exercise 1.31 

Categories as structures: 

(a) Let V be a given vector space. Show that the following data define a cate- 
gory V : 

V has just one object called *; 

the arrows of V from * to * are the vectors v in V; 

the identity arrow for * is the zero vector; 

the composite of vectors v and v' is their sum. 

Hint: With the data given, verify the equations of Definition 1.13. 

(b) Let A be a given partially ordered set (with partial order <). Show that the 
following data define a category X: 
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the objects of X are the elements of X ; 

for elements x,x' in X there is an arrow from x to x' exactly when x < x' . 

(It follows that there is at most one arrow from x to x' .) 

Hint: Here we did not specify the composition or identity arrows, but there is 
no choice about how to define them. Why? 

Exercise 1.32 

(a) (Dual categories) There are many methods of constructing new categories from 
known categories. An important example is the dual or opposite category of a 
category. Let C be a category. The dual category C op has the same objects as C, 
but for objects A and B the arrows from A to B in C op are exactly the arrows 
from B to A in C. Show how to define composition and identities for C op to 
make it a category. 

(b) (Slice categories) Another important construction is the “slice category”. Let C 
be a category and X an object of C. The slice category of C by X, denoted C/X, 
has objects the arrows of C with codomain X. Let / : A — >- X and g : B — X 
be objects of C/X. The arrows of C/X from f to g are arrows h of C such that 
/ = gh, i.e. they are the same thing as commutative triangles from / to g. 
Composition and identities are inherited from C. Verify that C/X is a category. 

(c) (Pointed sets) If we define objects to be pairs consisting of a set A and an 
element 1 — A of A, and arrows from A, a to B, b to be mappings A — ->■ B 
such that fa = b, then we obtain a category denoted by \/S. Verify that 1/5 
is a category. Does 1 /S have an initial object? Does it have a terminal object? 
Does the terminal object separate mappings in 1 /SI (This is a special case of 
a dual notion to slice category: For C a category and X an object of C, describe 
the category X/C.) 



2 

Sums, Monomorphisms, and Parts 


2.1 Sum as a Universal Property 

Our basic definitions of the fundamental operations 

ADDITION MULTIPLICATION EXPONENTIATION 

are all universal mapping properties; that is, we take as the defining condition on 
such a construction essentially “what it is good for”. 

Let us consider sums. Our naive picture of the sum is that the sum of two sets 
contains two parts, that the two parts arc the same size as the given sets, that the 
sum does not have anything in it except those two parts, and that the two parts do 
not overlap. These ideas can be expressed more briefly as follows: 

The two parts arc exhaustive and mutually exclusive. 



.T 

1 

A 

V v 

AH 


LZ_ 

w 


The sum as defined by the universal mapping property will, in particular, have two 
mutually disjoint parts equivalent to two given sets and that together exhaust the 
whole sum. However, it is not satisfactory to take these conditions as a definition 
for at least two different reasons: 

(1) There are categories other than abstract sets; for objects in these categories 
there is also a notion of sum but one in which the two parts may interact. The 
description in terms of nonoverlapping would be incorrect in such a case (see 
Exercise 2.40). It is better to use the same form of the definition so that we see 
the similarity more transparently. 

(2) The second reason is related to the gap between the naive picture and the 
formalized axiomatics. We always have some kind of naive picture for the 
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things we are trying to describe by mathematical formalism. But the naive 
picture usually does not match up perfectly with the precise formalism. There 
is a gap between the two. We try to get them as close together as we can. That is 
one of the struggles that moves mathematics forward. In particular, the words 
“mutually exclusive and exhaustive pair of parts” describe formally much of 
the naive picture of a sum. However, if we took that as the formalized axiom, it 
would not formally imply the needed stronger conclusion that we can always 
do certain things with the sum. We take those certain uses as the axiom. 

First, let us consider the particular case of the sum of two copies of a one-element 
set. 

Any two-element set will have two arrows from 1 (elements in the narrow sense); 
we could give them names o and l . 



1 


That the set 2 has two parts which arc disjoint and exhaustive means that these two 
arrows are different and that there arc no other arrows 1 — 5 - 2. If we tried to take 
that as our definition, we would meet a difficulty with other things we want to do 
with 2: Namely, what is a mapping from 2 into another set B2 It should just mean 
to give a pair of elements of B because such a single mapping has to have a value 
at one element and also a value at the other element. 

Obviously if we had a mapping 2—^B. we could compose it with the two 
elements of 2, and we would obtain two elements of B. 

Conversely, given two elements of B , say ho and h\. then the naive picture of 
2 and the “arbitrariness” of mappings suggest that there exists a unique mapping 
/ : 2 — 5 - B whose values are the given elements. We are led to define this particular 
sum as follows: 

Definition 2.1: The statement 


2=1 + 1 
means that 

there are given mappings 1 — ^ 2 1 such that 

VBA^BA-^B 3\f [f(o) = b 0 andf(i) = b l \ 


as in 



28 


Sums, Monomorphisms, and Parts 



The uniqueness means that if we had two mappings /, g, both with the specified 
values, then they would be equal. We already introduced the principle that there are 
enough mappings from 1 to distinguish maps. From that the uniqueness paid of the 
condition on / would follow if there were only the two maps from 1 to 2; that is true 
for the category of abstract sets (but not for all categories of interest). Concerning 
the existence of /, the naive idea that arbitrary mappings exist is a guide , but we 
want to prove all of the existence theorems from just a few; this is one of the few that 
we assume. For any set B , to map 2 into B it suffices to specify two elements; then 
there will be one and only one /. The exhaustiveness and disjointness of elements 
will follow (with the help of the other axioms). Knowing how a given object maps 
to all objects B in its category determines indirectly how each object maps into it, 
but some work will be required to realize that determination, even for the object 2. 

Recall the contrast of listings versus properties (see Chapter 1). A sum of l’s 
(such as 2) is clearly well-adapted to a simple listing of elements of an arbitrary 
object B. In the category S of abstract sets (and in a few others), 2 also serves well 
as a codomain for properties. 

It is really no more difficult to describe the sum of any two sets; but the sum of 
two sets is not merely a third set; rather, it is two maps as follows: 

Definition 2.2: A sum of the two sets Aq and A\ is a set A together with given 
mappings Aq A A i such that 

WB,Aq^B,A { ^B 31 f [fi 0 = fo and fh = fi\ 
as in the commutative diagram 
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We denote the unique / defined by the mappings /o and f\ by 



We abbreviate by writing A = A 0 + A\ with the mappings z'o and i \ understood. 

Thus, given any two sets Ao and A\, to make A the sum of the two, we must give 
z'o and z i with the universal mapping property presented at the beginning of this 
Section. For any B at all, if we have a map A B , then of course, by composing, 

we get two maps Ao — B and A\ — 5 - B\ but the sum injections z'o and z'i are 
special so that conversely, given any /o defined on one paid Ao and any /j defined 
on the other paid A 1 , there is exactly one / defined on the whole that composes 
with the z'o, z'i to give the original /o, f\. 

AXIOM: BINARY SUMS 

Any two sets Ao, A\ have a sum Aq— ^ A A\. 

Exercise 2.3 

Given a set A 0 with exactly two elements and a set A 1 with exactly three elements, 
specify a set A and two injection maps for which you can prove the universal 
mapping property of a sum of A 0 and A 1 . That is, given any /o and f \ , show how 
to determine an appropriate /. 0 

EXAMPLE: COGRAPHS 

We have already seen an important example of a mapping from a sum. The 
cograph of a mapping, which we described with a picture in Section 1.1, is ac- 
tually an instance. Indeed, suppose that / : A — 3- B is any mapping whatsoever. 
We can define a mapping c : A + B — f- B by c = jjU That is, the following is a 
commutative diagram: 



The cograph of a mapping is illustrated by its internal diagram (which we have 
already used informally many times!) by drawing the sets A and B disjointly and 
showing a linking of an element a of A to an element b of B whenever f{a) = b\ 
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thus, two elements in A + B arc linked in the internal diagram if and only if they 
are merged by the cograph mapping c. 

Exercise 2.4 

Illustrate the example from Exercise 2.3 by an internal cograph picture. 0 


Coming back to Definition 2.2, for any element 1 — ^ A, we have 


(*) 


fa = f 0 a 0 
fa = f\a\ 


if /o«o = ci 
if i\a\ = a 


for some element uq of Aq 
for some element a\ of 


Naively, there could not be two different maps / that satisfy the condition (*), 
for A is exhausted by the two parts; if there were an element a that could not be 
expressed in either of these two forms, then we could imagine that there could be 
maps / for which the equations (*) would be true and yet / could have various 
values at such a. In other words the exhaustiveness of the two parts corresponds 
naively to the uniqueness of the /. On the other hand, the disjointness has something 
to do with the existence of the /. What would happen if we had an element a that 
could be fed through both z'o and z i (i.e., the two parts overlap, at least on that point)? 



Then we could imagine that /o and f\ take different values at ao and a \ ; therefore, 
there could not be any / at all because even fa would not be well-defined. The 
naive content of the disjointness and exhaustiveness corresponds to the existence 
and uniqueness of such maps /. 

We will write A 0 + Aj = A as a kind of shorthand; it means that whenever we 
need to apply the universal property we are given the z'o and the i\ required. The 
universal mapping property has as a by-product that the “size” of A is determined 
by the sizes of A 0 and A!, but it has many more detailed uses than the mere “size” 
statement. 

How does the general sum relate to 2? If we are given a decomposition of any 
set A as a sum of two other sets Ao and A i , we will automatically have a mapping 
from A to 2, namely, the mapping that takes the two values on those two respective 
parts. That follows from our definition, for we could apply the universal mapping 
property by taking 5 = 2 and consider the two elements (one called o and the other 
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called 1 ) of 2; each of those can be preceded by the unique map that always exists 
from any set to the one-element set; i.e. we can take /o and /j to be, respectively, 

Ao’s unique map to 1 followed by the element o of 2 
Ai’s unique map to 1 followed by the other element l of 2 

Now we have two maps, /o and f\ going from Ao and A\ into the set 2, and so 
there is a unique / whose domain is the sum, whose codomain is 2, and such that 
these composites are equal to the respective constant mappings just constructed, as 
in the following commutative diagram: 

Aq >- 1 



A\ >• q 

On the part i 0 of the sum, this mapping / constantly takes the value o; the other 
composite is constantly the other element l of 2. 

f 

Conversely, any mapping A — >- 2 really is of the preceding sort because, if we 
are given the /, then we can define two parts of which A is the sum. We can imagine 
that each paid is the one whose members are all the elements for which / has a 
fixed value. That is, we may construct mappings into 2 by other means and then 
deduce the corresponding sum decomposition. 



Anytime we have this situation, we may say A has been divided into two parts. 
But what is a part and what is membership? We will discuss these concepts in the 
next section. 

Exercise 2.5 

Define explicitly what should be meant by a sum of three given sets. If V is any 
fixed set with exactly three elements, show that any mapping A — »■ V comes from 
a unique sum decomposition of A into three parts. 0 
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2.2 Monomorphisms and Parts 


Monomapping, or monomorphism, is a stronger version of injective, which we will 
have to use to define part in general categories of sets and for other puiposes also: 

Definition 2.6: An arrow S — ->■ A is a monomapping or monomorphism if it has 

the cancellation property with respect to composition on its right, i. e. , 

VT V.S| , si : T — s~S |;A| — is2 => .s'i = 5?] 
so whenever we have the configuration 

T — 4- S— U-A 

■n 

and is\ — is 2 , then s \ = si. 

Equivalently, 


VT Vx [T — A =>• 3 at most one 5 [is = x]] 

The expression above is called a cancellation property because, given an equation 
involving composition with i on the left, we can cancel the i. Since p and S 2 
arc generalized elements, the property says that i is injective even on generalized 
elements. The difference between monomorphic and injective is that, for the “mono” 
property, we require the cancellation for all T . This does not matter in the case of 
abstract sets, where cancellation with the general T or with just T = 1 means the 
same thing (see Exercise 2.7). That “mono” implies injective is tautologous because 
a general statement always implies any of its special cases. The converse statement 
is not tautologous; it depends on the existence of sufficiently many elements. 

Exercise 2.7 

Using the axiom that 1 separates mappings, show that 

injective =q> monomapping 


in the category of sets S. 0 

Exercise 2.8 

Show that if i and j arc composable monomoiphisms, then ji is a monomorphism, 
i.e. a composite of monomorphisms is a monomorphism. (Recall that injective 
mappings also compose.) 0 
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Exercise 2.9 

Show that any mapping 1 — ^ A whose domain is 1 is necessarily a monomapping. 

0 

Exercise 2.10 

Show that if U has the property that the unique U — >■ 1 is a monomapping, then 
any mapping U n > A with domain U is necessarily a monomapping. 0 

The origin of these words (including the term epimorphic, seen later to be related 
to surjective) is in ancient Mediterranean languages as follows: 

GREEK: LATIN: 

monomorphic injective 

epimorphic surjective 

We exploit the differences in the words to get slightly different meanings. The dif- 
ference between injection and monomorphism is not too great in practice, whereas 
the difference between surjection and epimorphism (the dual of monomorphism, 
which we will define later) is greater. What we mean by monomapping or injective 
mapping is one that does not identify (in the sense of “make equal”) any elements 
that were previously distinct. 

We are now ready to define part. 

Definition 2.11: A part of set A is any mapping i for which 

(1) the codomain ofi is A, and 

(2) i is a monomapping. 

Any part of A has some domain, say 5, but the property (2) means that the 
part keeps the elements distinct: if we have is\ = is 2 , we can conclude .V| = s 2 . In 
particular, if we return to the definition of sum, the injections into a sum have to be 
monomappings. 

Exercise 2.12 

Prove on the basis of the axioms for S so far introduced that if Ao —A A A 1 is 
a sum in S, then i 0 is a monomapping. 0 

Here is an example of the notion of part: There is the set A of chairs in a room 
and the set S of the students in the room. The mapping i assigns to every student 
the chair in which she or he sits. That is an injective mapping. Why? Because we 
have no situation of two students sitting in one seat. It is a monomorphism and 
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therefore defines a paid of the set of all seats. Students arc not chairs - we have to 
have a given mapping that specifies the way in which we have a part. The account 
of “paid of” given in most books would make no sense for abstract sets: It is not that 
every element of S is an element of A ! In our concrete example, the latter would 
mean that every student “is” a chair. But i is indeed a paid in the sense that we 
want to use the term in mathematics; it does define a condition on chairs - to be a 
member of the part means to be an occupied chair. 

2.3 Inclusion and Membership 

To express the relationship between parts that arises when they can be compared, 
we have 

Definition 2.13: The part i is included in the part j (or there is an inclusion/rom 
i to j ) means that i and j are parts of the same A and that 

3k[i = jk] 

A 

We write i C A j in this case and we sometimes omit the subscript A if the codomain 
(“universe of discourse”) is obvious. 

Exercise 2.14 

Show that if i is a monomorphism and i = fk , then k is a monomorphism. Give 
an example to show that / need not be injective. 0 

Thus, the k whose existence is required in the definition is always a monomap- 
ping, and so k is a paid of the domain V of j . 

Definition 2.15: The (generalized) element a of A is a member of the part i of A, 
denoted a G ,,i i, means that i is a monomapping and that 



3a[a = ia] 



U 



A 


Notice that we give the definition of membership for generalized elements. In 
the special case T = 1 we are stating that an element a of A is a member of the paid 
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i of A if there is an element a of the domain U of i that the part i interprets as a . 
(Note that there can be at most one such a.) 

Proposition 2.16: For any a, i and j ( the “universe” A being understood), 

a £ i and i (Z j =y a £ j 

Proof: The hypothesis gives us a and k for which a = id and i = jk. To establish 
the conclusion we would need a with a = j a. The only map at hand that even has 
the right domain and codomain as such an a is kd ; so define a =kd as in the diagram 



and see whether it satisfies the correct equation: 

j d = j(kd) 

= (jk)a 
= id 
= a 

Hence, a = j a as required to prove that a £ j . ■ 

Exercise 2.17 

If a £ i, then for all t, at £ i. 0 

In order to follow the next construction it will be useful to make explicit the 

logical rule of inference for implication-introduction. 

If X, Y, Z are statements, then 

(X & Y) =► Z holds 
if and only if 
Y =y(X => Z) holds 

(For more details about the symbols, see Appendix C.2 and Appendix A.) 

For example, if one takes X to be the statement a £ j, Y to be i C j, and Z 
to be a £ j, the proposition we have just proved is the statement with & in the 
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hypothesis. Thus, we could equally well say that we have proved the statement 
with the =>■ inside the conclusion: 

i Q j => [a e i => a e j ] 

Note that the variable a now appeal's in the conclusion but not in the hypothesis. 
We also need to make explicit the 

logical rule of inference for V-introduction. 

If Y and W are statements in which W depends on a variable a but Y does not, 
then 

Y =S W holds 
if and only if 
Y^da[W] holds 

Thus, our proposition above is further equivalent to 
i c A j =>• da[a e i =>■ a e j] 

where the statement on the right has the universal quantifier da ranging over all 
T A for all sets T . The converse of (this form of) the proposition 

da[a e i =d a e /] =y i c A j 

is also of interest. This converse is trivially true if the domain of a is arbitrary 
since, for the special case a — i,a — lu proves i e i; hence, if the hypothesis 
da [ ... is true, we get a proving that i e j , but that is the same as a proof k 

that i c j. However, the converse becomes more interesting if we restrict the range 
of the a’s for which we assume the hypothesis, for then it could be a nontrivial 
property of our category that the conclusion i c j nonetheless still followed from 
this weaker hypothesis. Something of this sort will be true of categories of variable 
sets, where it will be possible to restrict a to the indispensable domains of vari- 
ation of the sets. In the case of abstract (= constant) sets there is no substantial 
domain of variation, and so we can restrict the domain of a to be 1. That is, the cat- 
egory of abstract sets satisfies the strengthened converse proposition (a weakened 
hypothesis) 

Va[l a > A =>• (a e i =>■ a e /)1 => i c j 

Essentially, this follows from the idea of the arbitrariness of mappings: Note that 
what is wanted in the conclusion is a single actual map k for which i = jk, yet the 
hypothesis only says that 

dl-^U [3 1— ^ V jy = ix\ 
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(Here the a is the common value jy — a = ix). Since j is injective, the Ely can be 
strengthened to Elly. 

Thus, from the hypothesis we get 

Vx 3!y [jy = ix] 

The idea that mappings arc arbitrary means that, whenever Vx3!y[. . .] holds, then 
there does exist a mapping k - in this case one for which 

Vx[j(kx) = ix] 

Since there arc “enough” 1 A >- U, we conclude that jk = i, as required. 

1 



We will soon be able to prove this strengthened converse on a more explicit basis 
that does not depend on the vague “arbitrariness of mappings” principle. 

Historically many mathematicians such as Dedekind and Banach used the same 
symbol c for both membership and inclusion, and indeed it is quite reasonable to 
define x belongs to y for any two maps with codomain A and even to use the same 
symbol for it 

x C y iff 3z[x = yz] 

where iff is the standard abbreviation for “if and only if”. 

If y is not monomorphic, then such a z may not be unique; such z are often 
called “liftings of x along y”. In the formalized set theory following Peano, the 
membership symbol was given a completely different meaning, but mathematical 
use of it has been as we defined; thus, we feel justified in using the membership 
symbol for this special case of the “belonging to” relation even though it is not 
strictly needed. The notion obtained by reversing the arrows 

W = gh] 

8 



may be called / is determined by h and is at least as important in mathematics 
and its applications. Here the g, which shows a way of determining / values from 
h values, is often called an “extension of / along h”; such a g is only guaranteed to 
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be unique if h is “epimorphic” (the dual cancellation property). Epimoiphic maps 
with a given domain A are partitions (as opposed to parts) of A (see Section 4.4); 
again the special case where both / and h arc partitions rates special attention: in 
that case the determining g is often called a “coarsening” of partitions (rather than 
an including of parts). 

Exercise 2.18 

If / is determined by h , then any further < pf is also determined by h . 

General belonging and determining often reduce to the special cases of belonging 
to parts (i.e., membership) and, respectively, of determining by partitions. This 
happens because frequently a map y can be expressed as y = yoyo where y 0 is 
“mono” and y\ is “epi”; then, if x belongs to y, it follows that x is a member of the 
part yo of A: 


T — n > C 



A 

If z proves that x belongs to y, then yp proves the stated membership; conversely, 
if x e yo, then in a sense x “locally” belongs to y. 0 

Exercise 2.19 

Formulate the dual construction, showing that if / is determined by h and if h has 
a “mono/epi” factorization h = h 0 h\, then / is an invariant of the partition of 
the domain. (In suitable categories the converse will say that if / is an invariant 
of hi, then some “expansion” of / is determined by h\ the phrase “invariant of” 
is frequently used in this context and is dual to “member of,” and we have used 
the term “expansion” dually to a common use of the term “covering” to explain 
“holding locally”.) 0 


2.4 Characteristic Functions 

Given a chosen set V thought of as “truth values” and a chosen constant element 
1 — U- V thought of as “true,” we can define a relationship as follows: 

Definition 2.20: For any set A, mapping i with codomain A, and mapping A — V, 

<p is an indicator or characteristic function of i if and only if 

(1) i is monomorphic and 

(2) for any T and any T a >- A 


Cl £,4 l 


<pa = V\T 
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(For any 1 — c ^~C, cT denotes the constant map 

T^l^C 

with domain T and value c). 

If we refer to the element V\ as “true,” then a is a member of i if and only if tpa 
is “true” since that is true for all a : 


(pi — V\U where U = domii ) 

moreover for any a. if (pa = v\T . then there is a for which a = id: 



Exercise 2.21 

For example, if A — V and i — v i, then an indicator of iq as required above is 
<p=ly- 0 

The exercise shows that (1) iq is monomorphic, and (2) for any T and any 
T a >- V 


a Gy iq <=>• lyfl = V\T 

The idea of the indicator, then, is that the general membership relation can be 
represented by this particular' case. An important fact about the category of abstract 
sets is that for V — 2 and the chosen element 1 — U- 2 we have unique characteristic 
functions for all parts of all sets. That is, 

AXIOM: MEMBERSHIP REPRESENTATION VIA TRUTH VALUES 

(1) Any A 2 is the characteristic function of a part of A. 

(2) Any part of A has a un ique 2-valued characteristic function. 

For categories of cohesive and variable sets there will still be a V f 2 satisfying 
properties (1) and (2), but that the set 2 has these two properties is special to the 
category S and those special categories of variable sets called Boolean toposes. 

The preceding principle will have a great many uses. It enables us, for example, 
to count parts by counting 2-valued functions instead. Recall that counting requires 
a clear notion for distinguishing the things being counted. 
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Proposition 2.22: The following are equivalent for parts U ' »■ A and V 1 > A of 
a set A: 

(1) i C A j & j C A i, 

(2) 3h, k\j = ih & i = jk & hk — 1 u & kh = ly], and 

(3) i p = where these are the characteristic functions ofi and j, respectively. 

Exercise 2.23 

Prove the proposition, i.e. show that each of (1), (2), and (3) implies the others. 

Hint: (2) obviously implies (1), but to see the converse remember that i and j are 
parts. 0 

Definition 2.24: Parts i and j of a set A are equivalent, which we write i = A j if 
and only if i C A j & j C A /. 

Thus, characteristic functions do not distinguish equivalent parts, and parts are 
distinguished for counting puiposes only if they are not equivalent. Indeed, we will 
sometimes abuse language by writing equivalent parts as equal. However, even if 
two parts have the same number of elements in the sense we will describe soon in 
Section 3.2, they need not be equivalent. 

Exercise 2.25 

If i and j are parts of A and their mere domains are isomorphic, meaning that 
there are h, k satisfying hk = \ v & kh = \ v , then neither part need be included 
in the other. However, if k also proves that i c A /, then it follows that i = A j as 
parts. 0 


2.5 Inverse Image of a Part 

A frequent use of parts is in restriction of mappings. 

f 

Definition 2.26: If X — s- Y and ifi is a part of X, then the composite fi 

is called the restriction of f to the part i ( often denoted f\i or f |, J. 

Exercise 2.27 

Give an example of a surjective / and of a paid of its domain such that the restriction 
is not surjective. Give an example of / that is not injective but such that a restriction 
of / is injective. 0 
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For example, if / describes the temporal variation of temperature throughout 
a year and if j is the subzero paid of the temperature values, then there can be a 
certain week i in January such that fi e j . 

Exercise 2.28 

Monomaps are only left-cancellable and are not usually right-cancellable. Give an 
example of distinct /, g with equal restrictions to the same paid. 0 

An operation of fundamental importance in geometry, analysis, logic, and set 
theory is that of forming the 

INVERSE IMAGE OF A PART ALONG A MAP 

which also goes under names like “substitution” or “pullback” in certain contexts. 
Thus, if we are given an arbitrary map / from A to Y and an arbitrary paid V c — Y 
of the codomain, then there is a paid U c — X of the domain such that 

(0) for all 7’ * > X 


x e i <t=> fx e j 

This is easily seen to imply that 

(1) there is a commutative square 


and that, moreover, 

(2) any commutative square 



can be expressed in terms of (1) by means of a common x: 
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X — ix 
&_ 

y = fx 


In other words, / restricted to the paid i has its image in j, and i is the largest 
paid of X with that property. 

(3) The x is unique (because we have assumed that i is a monomapping). 


Exercise 2.29 

(a) Show that (0) => (1), (2), and (3). 

(b) Show that (1), (2), (3), and j is mono =4> i is mono. 

(c) Show that (1), (2), and (3) => (0). 

(d) Show that for given /, j, if both z i , E satisfy (0), then i\ =x h as parts of X. 

0 


The part i described above is called the inverse image of j along /. The operation 
of inverse image has the following contravariant functoriality property: 


Exercise 2.30 

If j is the inverse image of k along g and if i is the inverse image of j along /, 
then i is the inverse image of k along gf. 


u 

-d 

r 

"d 


j 


x - f — -y j — 


o 


When there is a truth-value object V (such as the set 2 in the case of abstract sets), 
i.e. an object “classifying” all parts by means of unique characteristic functions, 
then inverse image is so intimately related to composing that the functoriality above 
translates into a case of associativity (see Exercise 2.34). 

Proposition 2.31: Y — ^ V is the characteristic function of a part j if and only if 
j is the inverse image along f of the one-element part 1 ' ' V of V. 
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Exercise 2.32 

Prove the Proposition 2.31. 

Hint: Apply the definition of indicator. 0 

Theorem 2.33: Iff/ is the characteristic function of a part j, then for any /, frf 
is the characteristic function of the inverse image of j along f. 


Proof: Apply the functoriality of inverse image to the special case indicated. 

U 



Exercise 2.34 

Interpret rj(gf) = {rig) f in terms of the inverse images: 

Vi 

f 8 V V 

X *Y — — >- 2 


0 


An important special case of inverse image is intersection, which is what results 
from inverse image along a map that is itself a paid. 


Exercise 2.35 

The composite of monomaps is again such. Hence, if both /, j are monomaps and 
i is the inverse image of j along /, then m = fi = j f is also a part, and for any y, 


y em \y e j & y e /] 


T 



0 
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Definition 2.36: For ports i and j with the same codomain Y, a part m of Y is 
called their intersection if and only if for all T, for all T - >■ Y 

y e m •<=>■ [y e i & y e j] 

The m so defined is usually denoted m = i D y . 

Remark 2.37: Exercise 2.35 proves that the inverse image of a part j along a 
monomapping / is the intersection of the two parts j and /. 

Exercise 2.38 

Prove that we have j\ D 72 = 72 H f. 0 

Inverse images along arbitrary maps preserve intersection of parts in the following 
sense: 

Exercise 2.39 

f 

If X — >- Y and if yj , 72 are any two parts of Y, and if 4 is the inverse image of jk 
along / for k = \, 2, then the inverse image of 71 D 72 along / is i\ PI D, where 
the latter indicates intersection of parts of X. Briefly, / _1 (y'i D 72) = f~ l (ji) D 
f~\j 2 ), where we have exploited the essential uniqueness of the inverse image 
operation to justify introducing a notation / _1 ( ) for it. 0 


2.6 Additional Exercises 

Exercise 2.40 

The category of vector spaces and linear transformations has sums, but they are 
very different from the sums in S. Let V and V’ be vector spaces. 

(a) Show that the vector space V x V' of ordered pairs consisting of a vector from 
V and one from V' (specify how this is a vector space!) is the sum. 

Hint: The injection V -Y- V x V sends v to (v, 0). 

(b) Show that monomorphisms are injective linear transformations (i.e., those with 
kernel 0). Thus, a subspace of a vector space determines a monomorphism to 
the vector space ( part of the space). 

Recall that an object 1 that satisfies the axiom of the one-element set in a category 
(i.e., for any object there is a unique map to 1) is called a terminal object of the 
category. Dually, an object 0 that satisfies the axiom of the empty set in a category 
(i.e., for any object there is a unique map from 0) is called an initial object of the 
category. 



2.6 Additional Exercises 


45 


(c) Show that the zero vector space is simultaneously a terminal object and an 
initial object (such an object is called a zero object). 

(d) Show that any paid of a vector space has an inverse image along any linear 
transformation. Remember to verify that the inverse image of the mapping is a 
vector space. 

Exercise 2.41 

(a) Show that injective group homomorphisms are monomorphisms in the category 
of groups and group homomoiphisms. 

(b) Show that the one-element group is a zero object. 

The category of groups also has sums, but their construction is rather complicated - 
they arc sometimes called “free products”. 

(c) Show that any paid (= subgroup) of a group has an inverse image along any 
group homomorphism. 

Exercise 2.42 

(a) Show that the category of partially ordered sets has sums. 

Hint: The sum partial order is on the sum of the two sets involved. Elements 
arc related in the sum exactly if they are related in a component of the sum of 
sets. 

(b) Show that monomorphisms in the category of partially ordered sets arc injective 
monotone mappings. 

(c) Show that the one-element set and the empty set with the only possible order 
relations are the terminal and initial ordered sets. 

Exercise 2.43 

An important case of slice categories (see Exercise 1.30(e)) is the category of X- 
indexed families of abstract sets S/X. Recall that in S/X objects are mappings 
with codomain X and arrows are commutative triangles. The name “family” arises 
as follows: For any object A—^X of S/X and any element \-^>~X of X the 
inverse image of x along / is a paid of A denoted A x and is called “the liber of 
A over jt;” A is the “sum” of the family of all of its libers. This is a very simple 
example of a variable set. 

(a) Show that the category S/X has binary sums that are computed “fiberwise”. 

(b) Show that monomorphisms in S/X are also “fiberwise” and have characteristic 
morphisms taking values in the object £2 of S/X, which has each fiber equal 
to 2. 
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(c) Show that the terminal object in S/X is X — X, and the initial object is the 
unique 0 — X . 

(d) Show that S/X has inverse images computed using the inverse images in S. 

Exercise 2.44 

(a) What are the parts of the terminal object in S/X (see the previous exercise)? 
Show a one-one correspondence between them and elements of the object Q 
above (i.e., arrows from the terminal object to Q). 

(b) Show that the terminal object in S/X is not in general a separator. However, 
show that for any parallel pair of different maps in S / X there is a map from a 
part of the terminal object to the common domain that distinguishes the parallel 
pair. 

Exercise 2.45 

A special case of the preceding examples occurs when X = 2. Here 2 has the 
elements o and t, and so each object of S/2 is just the sum of its two libers. A 
related category is the “category of pairs of sets,” which we denote S x S. The 
objects of this latter category are pairs written (A 0 , A () of objects of S. An arrow 
from (Ao, A\) to (B 0 , B \ ) is a pair of mappings Ao — ^ Bq, A\ — V B\ . 

(a) Show that S x S is a category. 

(b) Show that any object (Ao, Ai) of S x S defines a unique object of S/2, and 
conversely. Show that any arrow of S x S defines a unique arrow of S/2, and 
conversely. 

(c) Show that the correspondences in (b) respect identities and composition and 
have the property that when their actions on objects are “composed” with each 
other the resulting object is isomorphic to the stalling object. In this situation, 
we say that S x S and S /2 are equivalent as categories. 

Exercise 2.46 

(a) Show that the category of pointed sets 1 /S (see Exercise 1.30(f)) has sums. 

Hint: The sum of A , a and B , b is the set formed by taking the sum of the sets 
A and B and then merging the elements a and b in A + B. Thus, the sum of a 
two-element pointed set and three-element pointed set has how many elements? 

(b) Describe monomorphisms in 1 /S. 

(c) Show that 1 /S has inverse images. 
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Exercise 2.47 

Let X be the category defined by an ordered set X (see Exercise 1.31(b)). 

(a) Show that objects x,x' of X (elements of X!) have a sum exactly when they 
have a least upper bound. 

(b) Show that any arrow of X is a monomorphism. 

(c) Show that a greatest element of X is a terminal object in X, whereas a smallest 
element of X is an initial object. 

(d) Show that a paid of x (i.e., u < x) has an inverse image along an arrow to x 
(i.e., x 1 < x) if and only if u and x' have a greatest lower bound. 
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3.1 Retractions 

We begin this chapter by observing an important condition which implies that a 
mapping is a monomapping, namely, that it has a retraction. 

Definition 3.1: For mappings r and i in any category, r is a retraction for i means 
that ri is an identity mapping. In the same situation we say that i is a section for r. 

To see more clearly what this says about domains and codomains, let X -Y- Y ; 
then, we see that r must have domain Y (in order that ri be defined) and codomain 
X (in order that ri could possibly be an identity mapping, necessarily that on X). 

r 

X : — -^Y ri = lx 

l 

For a given mapping i , there may or may not be a retraction r for i , and there 
might be many. (Similarly, a mapping r need not have any sections.) But to have 
even one retraction is a strong condition: 

Proposition 3.2: If i has at least one retraction r, then i is a monomapping ( = 
left-cancellable mapping, which we have seen is the same as injective mapping. ) 

Proof: We must show that for any jti, xi as shown 

T -U- y 

X 2 

that ix i = ix 2 => x\ = x 2 . 


48 



3.1 Retractions 


49 


So suppose ix] = ix 2 . Then, 


r(ix 1 ) = r(ix 2 ), 
(ri)jci = (ri)x 2 , 

X] = X2 


so associativity of composition 
gives but ri being an identity 
map, this is as desired 


(Note how closely this proof parallels the sort of calculation done in elementary 
algebra: Suppose we know 7xi = 7x 2 ; then, 

y(7x0 = |(7* 2 ) 

(±7)xi = (±7)x 2 
lXl = lx 2 
Xi = Xo 

The construction in our proof is analogous to division by 7; i.e. to multiplication 
by a multiplicative inverse for 7. The only new observation is that | does not really 
need to be an inverse for 7, that is, to satisfy both y7 = 1 and 7= = 1 ; only the first 
of these equations was used - it is enough to have a left-inverse to be able to cancel 
a factor on the left.) 

Notice that this calculation gives more than merely knowing that i can be 
cancelled; having r provides a specific reason, or proof, of the left-cancellability of i . 

A mapping i that has a retraction is also called a split injection or split mono- 
mapping. Thus, 

Definition 3.3: The arrow i : X — >- Y is a split injection if and only if 

: Y — X [ri = l x ] 

Such a mapping r is also sometimes called a “splitting for i” instead of “retraction 
for i” or “left-inverse for z”. Here is a picture with internal diagrams (since this is 
a monomapping, it is already a paid of F): 


X *- Y 
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Now for r to be a retraction from Y back to X along i, ri = lx, we see that for 
any element x 


r(ix) = ( ri)x = x 


so any element y of Y that came from an x must be sent by r back to where it 
came from, and that is all the equation ri — lx says. It says nothing about the other 
elements of Y, if there are any; but of course, to be a mapping Y — >X, r must be 
defined for all those other elements of Y as well, and r must send them somewhere 
in X (it does not matter where we send them, and so there will usually be many 
possibilities for the mapping r). 


X 


r 


? 


Y 



One possibility is 



assigning to each student the chair in which that student is sitting, r must assign 
a student to every chair, and the equation says that to each occupied chair must 
be assigned the student who is occupying it. But for the other chairs we can make 
any definite decision we want; for instance, r could assign all the unoccupied chairs 
to one particular' student, or any other rule could be adopted for the unoccupied 
chairs and would nonetheless complete the definition of yet another retraction r for 
our given i . 

We hope this example will show that, in the category of abstract sets, almost every 
monomapping has at least one retraction: just pick any element of the domain to be 
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the recipient of all the extra elements of the codomain. The only difficulty arises 
when the domain is empty and the codomain is not: the (unique) map 

o^y 

is a monomapping but is not split if Y f 0. 

Claim 3.4: In the category of abstract sets and mappings, if X c — Y is a 
monomapping with domain X not empty, then i has a retraction. 

To prove the claim we will use two properties of the category of abstract sets and 
arbitrary mappings whose precise relation to our axioms will be established later 
in Section 6.1: 

(1) If X ^ 0, then X has an element 1 X\ 

(2) every paid X f — & Y has a complement X ' c — 1 >■ Y, i.e. 

X-^Y-^—X' 

is a sum diagram. 

Now. we proceed with claim 3.4; X is not empty, so let 1 — X be an element and 
X' — U- y a complement of i . The defining property of sum says that to define any 
mapping Y — =- X (in particular - the r we are looking for) is equivalent to specifying 
mappings from the two parts whose sum is Y. We use the mappings we have, 



where the bottom arrow is the only mapping X' — >■ 1 . Now the universal mapping 
property of sum tells us that there will be a mapping Y >-%- X, making the diagram 
commute as follows: 


X 



X' 
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Of course, we do not really need the commutativity of the bottom triangle, for the 
top triangle says ri — lx, which is all we wanted. ■ 

A proposition of this type is definitely not true in other categories with more geo- 
metrical content. Examples of such categories are in the next exercise. We discuss 
additional examples below in Exercise 8.1 1. 

Exercise 3.5 

The category C has as objects those parts of R", for various n, which arc open. The 
arrows of C are mappings between the objects that are continuous. (For definitions 
of open and continuous see any advanced calculus text.) Show that C is a category, 
that is, define composition and identities and check the axioms. A subcategory V 
of C is defined when we restrict to mappings that have a derivative. What rule of 
elementary calculus shows that V has composition? A subcategory of V is obtained 
by restricting to the smooth mappings - those that have all derivatives. 0 

In the category C, even though the sphere is a retract of punctured space (the 
open part of R" obtained by removing a single point), it is not a retract of the whole 
(unpunctured) space. It is important to look at examples like this to get a better idea 
about what part of our reasoning can be used in these more general situations without 
change (for instance ri = lx always implies i is a monomapping) and what paid 
is special to the category of abstract sets S (for instance i is a monomapping with 
nonempty domain implies 3 r[ri = 1]). Let us describe the example more precisely: 

In «-dimensional space (e.g., n = 2, the plane) we have the (n — 1) -dimensional 
sphere (i.e., the circle, if n = 2). 

Claim 3.6: There does not exist any continuous retraction for the inclusion: 
(n — I f-dimcnsional unit sphere r — //-dimensional space, centered at the origin. 


n — 2 



X 


Y 


This claim is proved in analysis but is reasonable on the basis of the intuition of 
continuity. On the other hand, if we change the situation only slightly and replace 
the codomain Y by the “punctured plane” with the origin removed, then there is a 
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fairly obvious continuous retraction: just send each point in the punctured plane to 
the unique point on the circle that lies on the same ray from the origin, i.e., 



If our circle has radius 1, the formula for r is 

, , , >’ l >’2 . 

riyuyi) = (n- 77) 
\y\ lyl 


where |y| = y y\ + y\ is the distance from the point y — (yi, yi) in the plane to 
the origin. 

We could also have considered as codomain a unit ball rather than the whole 
Euclidean space with the same result: that the unpunctured version is completely 
different from the punctured version with respect to the question of continuously 
retracting onto the sphere. 


Exercise 3.7 

Suppose S — U- B —X E are maps in any category. Then i has a retraction if ji 
does. 0 


That our proof of the ubiquity of retractions for abstract sets does not work for 
continuous sets can be understood more exactly if we recall that it was based on 
the concept of sum and still more exactly on the concept of complementary paid. 
Although sums exist generally in the continuous category, it is emphatically not the 

case that to every paid Xq c >- Y there is another paid X\ ! >- Y together with 

which it sums to Y . Indeed, we might take X \ as the “rest of” Y after removing A (l , 
but the defining property of sum, (that any pair of maps /o, f\ 

X 0 
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can be combined via definition by cases to give a single map Y — >■ Z) will fail in 
the continuous category C: even if /o, f\ were both continuous, their combination 
on Y would not be unless a further condition were verified. For example, if X Q is the 
circle included by /'o as the boundary of a disc Y and if i i is the “rest” (i.e., the open 
disk included by ;'i as everything in Y except the boundary), then for continuity of 
the “sum” map one needs not only continuity of / 0 , f\ separately but also that 

lim fi(y n ) = f 0 (x) 

n — >00 

whenever y n is a sequence in the open interior ;'o, which tends to x on the boundary. 
These matters are discussed more fully in courses in analysis and geometry. 

3.2 Isomorphism and Dedekind Finiteness 

“Equations” between sets, such as the statement that a set is the sum of two others 
or that it is the inverse image of a paid along a map (and many others that we will 
meet soon such as the commutative law for products, the laws of exponents, the 
distributive law, etc.) are not really equations but rather statements that certain sets 
have the same number of elements. More precisely, the intended statements often say 
more: a specific demonstration of the equinumerosity is stated to be available. Such a 
demonstration is a given isomorphism in the category of abstract sets and mappings. 

f 

Definition 3.8: X > Y is an isomorphism (or is invertible) if and only if there 
exists a Y — X for which both equations gf = l x cmd fg = 1 y are true. Such 
a g is called a (two-sided) inverse for f. 

We obviously have the implications 

/ isomorphism 
A t 

f has a retraction and / has a section 
B ^ 

/ is injective and / is surjective 

Deft 

f is bijective 

But we claim that implication A is reversible in any category and that implication 
B is reversible in the category of abstract sets and arbitrary maps; thus, in this 
category the isomorphisms are the same as the bijections (another name for which 

is one-to-one correspondences). 

The reversal of implication A above means that, even though retractions (= left 
inverses) are not unique, and neither arc sections (= right inverses), nonetheless if 
both exist for the same /, then they are all equal: 
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Proposition 3.9: In any category, 

r 

if X *- Y then 

S 

A* : rf = l x and fs = l Y =>■ r = s 

C : 3 r[rf = l x ] => ds ] s 2 \fs ] = \ Y and fs 2 = 1 r => si = s 2 ] 

Proof: On the assumption of the hypothesis of A*, 

r — r\ Y — r(f s) = ( rf)s = 1*5 = s 

proving the conclusion of A*. It should be clear that A* implies the converse of A. 
Because A* is known, the proof of C requires us only to notice that C is equivalent 
to 

Vs i, sfit'lrf = l x and fs\ = l Y and fs 2 = 1 y =4> si — S 2 ] 
which follows from two applications of A* together with the remark that 

r = si and r = S 2 =>■ si — s 2 


Stated in words, C says that any two sections of / are equal provided that / has 
at least one retraction; the proviso is certainly needed since most maps that have 
sections have more than one. 

The reversibility of implication B above, namely that any bijection has a two- 
sided inverse, does not hold in some categories (such as that of continuous maps), 
but it does hold in case the inverse is allowed to be an “arbitrary” map. One way to 
prove it would be to assume that every surjective / has at least one section s (this 
is the axiom of choice; see Section 4.3) and then use the following generally valid 
proposition. 

Proposition 3.10: If fs — l Y and if f is a monomorphism, then s is the two-sided 
inverse of f. 

Proof: We arc supposed to prove that sf — \ x , but as in other cases when we can 
hope to use a cancellation property, we first prove something a little longer, 


f(sf) = (fs)f = l Y f = f = fl x 
from which the conclusion follows by left-cancellation of /. 
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However, that bijection implies isomorphism does not really depend on a power- 
ful principle such as the axiom of choice; the latter is usually only required when 
the map g to be constructed is not unique and hence must be chosen arbitrarily. But 
the bijectivity of / says exactly that 

Vy [fx = y ] 

X 

the existence of x coming from the surjectivity of / and the uniqueness from 
the injectivity of /. Hence, by the concept of “arbitrary map” there should be a 
well-defined map g such that 

gy = the v for which [fx = y] 

for all y; then of course gfx = x for all x and fgy = y for all y so that g is the 
two-sided inverse of / because 1 is a separator. 

Notation: If gf = l x and fg = I Y , we write / -1 = g. 

f f -1 

Proposition 3.11: If X — >■ Y is an isomorphism, then Y - — 5 - X is also an 
isomorphism. 

Proof: Writing the two equations gf — 1 Y , fg — lx in the opposite order, we see 
that / = g~ l , i.e. the f~ l has / as a two-sided inverse. ■ 

Proposition 3.12: If each of f\, /1 is an isomorphism and if fif\ makes sense, 
then / 2/1 is also an isomorphism. 

Proof: Let g \ , g 2 be two-sided inverses 



Sl g2 



The only obvious candidate for the inverse of the composite / = / 2/1 is the com- 
posite in opposite order g = gig 2 of the inverses. Indeed 

gf = (gigiffifi) = gi(.g 2 fi)fi = gilr/i = gift = lx 
fg = (f2fl)(glg2) = fl(flgl)g2 = f2^Yg2 = f2g2 = 1 Z 



3.2 Isomorphism and Dedekind Finiteness 


57 


In other words, 

(/2/ir 1 = /f '/T 1 


is the required two-sided inverse. ■ 

Notation: / : X — A*- Y means that / is an isomorphism. We say X is isomorphic 
with Y, written X = Y if there exists at least one / from I to f, which is an 
isomorphism, i.e. 


3/ [f:X Y | 

As discussed previously this definition is used in every category, but in the case 
of the category of abstract sets and arbitrary maps it is often referred to as being 
equinumerous or having the same cardinality with isomoiphic being reserved for 
other categories in which naturally some further structure (such as group structure 
or topological structure) will be the “same” between isomoiphic X, Y. However, 
abstract sets should be considered as the case of “zero structure,” and because 
we have learned to treat the number zero on an (almost) equal footing with other 
numbers, we should simplify terminology and our thinking by treating the cate- 
gory of sets on an (almost) equal footing with other categories such as are met in 
more structured parts of mathematics. That isomoiphism of abstract sets gives us 
a way to study equinumerosity without counting (hence also for infinite sets) was 
exploited systematically by Cantor. (He was anticipated to some extent by Galileo 
and Steiner.) 

Exercise 3.13 

[Galileo Galilei, 1638] The set of all nonnegative whole numbers is isomoiphic 
with the set of all square whole numbers because the map fix) = x 2 has (with the 
stated domain and codomain) a two-sided inverse g. What is the usual symbol for 
this g? 0 

Of course counting will be one of the methods for investigating size (all methods 
are consequences of the definition). Namely, if X is thought of as a “standard” set 
(like { 1 , 2, 3, 4, 5}), then a specific counting process that counts a set Y is of course 
a specific invertible map / : X Y . (Note that the same Y can be counted in 

different “orders” by the same X, which would involve different maps.) If Z can 
also be counted by the same X , we can then conclude that Y and Z arc (objectively) 
the same size (which is true independently of our subjectively chosen standard X 
and of our choices of counting order): 
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Corollary 3.14: 


X 



Proof: By Propositions 3.11 and 3.12, h = gf 1 shows the required existence. 


The observation of Galileo that an infinite set can be isomorphic to a proper paid 
of itself was turned into a definition by Dedekind: 

Definition 3.15: A set X is said to be Dedekind-finite if and only if 

f 

V/[ X — s- X and f monic => f isomorphism ] 
and correspondingly X is said to be Dedekind-infinite if and only if 

f 

3f[X >■ X and f monic and f not surjective ] 


Exercise 3.16 

Show that X — [0, 1], the unit interval of calculus, is Dedekind-infinite. 0 

Exercise 3.17 

If X is Dedekind-finite, A c — X is a paid of X and A is equinumerous with X 
(i.e., A = X), then the paid is the whole in the sense that i itself is invertible. 0 


3.3 Cartesian Products and Graphs 

There is a common theme involving several different interrelated constructions that 
we will now explore: 


FINITE INVERSE LIMITS 


Cartesian operations < 


Terminal object 

Inverse image (of a part along a map) 
Intersection (of parts) 

Equalizers 

Cartesian products 

Pullbacks (= fibered products) 


and numerous subsidiary notions like 
Graphs, diagonal, projections 



3.3 Cartesian Products and Graphs 


59 


The French philosopher Rene Descartes is said to have introduced the idea that 
dealing with more than one quantity can be done by introducing higher-dimensional 
space. Along with that goes the idea of describing parts of higher-dimensional space 
by means of equations. For example, consider the portion (circle) of the plane 
where a certain equation is true. The operation of passing from the coordinatized 
plane to the portion where the equation x 2 + y 2 = 1 is true will be an example of 
the equalizer process. We can also intersect figures in higher-dimensional space, 
raising questions such as: If two figures are described by equations, then is their 
intersection also described by equation(s)? and so on. There is a rich inteiplay of 
all these Cartesian operations, which arc also called finite inverse limits. 

We will find that Cartesian products and equalizers will be sufficient to construct 
inverse limits, and so it turns out that inverse images can be defined in terms of 
equalizers and Cartesian product. Alternatively, equalizers can be defined in terms 
of inverse images, and thus we will have to understand not only what each of these 
operations is but also how some of them can be defined in terms of others because 
these relationships are part of the lore of how the operations are used. 

Let us consider first a very simple but important example, namely the product 
2x2, where 2 is a two-element set whose elements we can call 0, 1 for convenience. 
It will then turn out that the product 2x2 has four elements, which can be labeled 

( 0 , 1 ) ( 1 , 1 ) 

( 0 , 0 ) ( 1 , 0 ) 

by “ordered pairs” of elements of 2. The universal mapping property, which we 
will require of products in general, amounts in this case to the following: Suppose 
X is any set and suppose that 

X '-^>2. X^2 

are any two given maps with the same domain X and whose codomains are the 
factors of the product 2x2 under consideration; then, there is a uniquely defined 
single map 


X -^2x2 

whose domain is still X but whose codomain is the product and whose value at any 
element x is the element of 2 x 2 labeled by the ordered pair of values determined 
by the given cp, \jr. 


{(p, xfr)x = (tpx, \j/x) 

The mapping (cp, if/) can be understood in terms of the parts of X for which <p, \[r 
are the characteristic functions with 1 interpreted as true and 0 as false. If x belongs 
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to the part characterized by <p but does not belong to the paid characterized by r//, 
then, at such an x, (tp, f) has the value 

(< P , i 'r)x = ( 1 , 0 ) 

and similarly for the other three kinds of element x definable in terms of tp, \fr . Now 
the mappings 

2 x 2 >2 

are usually called propositional operations or truth tables. For example, the map & 
described by the table 2x2 2 


u 

V 

u & v 

0 

0 

0 

0 

l 

0 

l 

0 

0 

l 

1 

1 


is usually called “conjunction” and read “and”. Now a composite 


2x2 



( pSzij. ') 


with a propositional operator such as & yields a new property cp& \Js of (elements 
of) X constructed out of the two given properties tp, x/f. A propositional operation 
corresponds, via characteristic functions, to an operation on parts of any given X. 


Proposition 3.18: Ifcp is the characteristic function of the part i ofX and f is the 
characteristic function of the part j ofX, then 

<p & f is the characteristic function ofi fl j 

or, equivalently, if we denote by {X \ cp} the part ofX with characteristic function 
<p, we have 

{X\cp&xlr} = {X\<p}n{X\xlf} 


Exercise 3.19 

Prove the proposition. 0 

The product of any two sets A, B should be thought of as a “bag of dots” P 
that has “the right number of elements” and is moreover equipped with a pair of 
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“projection” maps it a, tr B that “arrange P in rectangular fashion” as follows: 

P 


WA 


A 


These features will follow from the definition below for products that directly 
express the most basic act we constantly do in actual use of products. In the category 
of abstract sets and mappings we will use the same definition of product that we 
use in any category. 

Definition 3.20: In any category, a product of two objects A and B is a pair of 
given mappings 

A^pd^B 



such that 

VX,f:X >A.g:X -z-B 3\h : X -> P [f = n A h and g = n B h\ 
as in the following commutative diagram: 



AXIOM: BINARY PRODUCTS 

Any two sets A , B have a product, A P -dd- B. 

Usually we write P — A x B, h = (/, g) so that the diagram in the axiom is 




62 


Finite Inverse Limits 


Exercise 3.21 

Show directly from the axiom that there is a map 

B x A Ax B 


for which 


A.fi B,A 

77 A T A,B ~ 77 A 


— A , B _ B,A 
77 B Ut,fi — 7t b 


Using the uniqueness of the induced map as required in the axiom, show that 

Ta.bTb.A = IaxB 


the identity mapping. Caution: In dealing with more than one product, it is often 
necessary to use a more precise notation for the projections as we have done in the 
description of r. 0 


Exercise 3.22 

Show that for any A, 1 x A = A. 

Hint: To show one of the equations requires using the uniqueness clause in the 
definition of product. 0 


The uniqueness of the induced maps into a product enables one to calculate their 

f p „ 

values at (generalized) elements: If X >■ A, X > B, and if T >■ X, consider 
the diagram 



We see that (f,g)x has the projections 

^ a ((/, g)x ) = ( 77 A if, g))x = fx 
7tB({f, g)x) = (Jlsif, g))x = gX 

Hence, by uniqueness of maps with specified projections, we obtain 

(f'g)x = (fx,gx) 

An example apparently less abstract than that of 2 x 2 can be obtained by con- 
sidering the description / of the flight of a fly trapped in a tin can C (during a time 
interval T). If A denotes the points on the base of the can, and if B denotes the 



3.3 Cartesian Products and Graphs 


63 


points on the side seam, then there is a map C ' T/ A A that assigns to each point of 
the space in the can its “shadow” in A, and there is also C B assigning to each 
point in C the point on the seam at the same height. Then by composition we obtain 



But conversely, if we were given a mapping T —A A from times to A (for example 
by a movie film that had been shot with the camera looking up from below through 
a glass bottom), and if we were similarly given a temporal record T A B of 
the heights, we could uniquely reconstruct the actual flight T — ^ C as the only 
mapping satisfying the two composition equations. 


Definition 3.23: If A — ->■ B is any mapping, then (relative to a given object config- 
ured as a product of A and B ) by the graph of f is meant the mapping A —A- A x B 
whose projections are the identity map I i and f respectively as in the commutative 
diagram: 



If = (1 aJ) 


For example, for any element r of A the value of yj is yy(x) = (x, fx) in the 
product: 

Proposition 3.24: The graph of f is a part of A x B. The elements of A x B. 
which are members of the part yj , are precisely all those whose form (x, y) has 
the property that y = fx. 
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Proof: Since jt a y / = \ A , we have that tt a is a refraction for yj and hence yy is 
monic and therefore a paid of its codomain A x B. If T — A x B is any element 
of A x B, it is necessarily of the form 

P = (x,y) 

where* = n A p , y = Jt B p.lfy = fx, then y/(x) = (x, fx) = {x,y) = p so that 



Ax B 


Conversely, if p is a member of yy there must be an x that “proves” this membership 
so that p = (x, fx). ■ 


Using the picture of a product as a rectangular arrangement with pairs labeling 
its elements, we get a very graphical internal picture of a map that is at least as 
important as the cographical internal picture of /. 



When we need to calculate explicitly using this idea that the graph is also a map, 
we recover the original / just by composing 

/ = ttbYi 


Proposition 3.25: Any section of the projection Ax B —A A is the graph of a 
unique map A — 5 - B. 

Proof: Recall that a section of a mapping n is a mapping s for which its = l A . If 
jr = jt A is the projection from a product we can compose a given section s with the 
other projection, obtaining 
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But then s must be the graph of the / so defined because 

Yf = (U, /> 

is the unique map for which the projections are 1 a and /, and by construction s 
has those two projections. ■ 

Proposition 3.25 suggests considering that for any map 

E^A 

(not necessarily a projection from a product), any section s of jt could be regarded 
as a “generalized map” with domain A but with a variable codomain, that is, the 
value of such a “map” at 1 -X=- A lies in the fiber E x of n over x, and these fibers 
may vary. This point of view is very important in mathematics when, for example, 
considering vector functions, fiber bundles, and so forth. 

Exercise 3.26 

Show that the fiber (= inverse image of x along jt) of jt at x “is” B if tt = 
tv a • A x B — >- A. 0 

In the case of a product with two (or more) factors repeated, we may use a slightly 
different way of abbreviating the notation for the projections: 

ry JTQ 

A x A — A . J J A Tto(ao, fli) = «o Tt\(ao, a\) = a\ 

Moreover, among the many possible graphs in this case, there is an especially 
simple one that nonetheless sometimes needs to be mentioned explicitly, the graph 
of the identity map on A, which is often called the diagonal map 8 A = (1a, 1a), so 

A A x A ° ^ A satisfies tt^a = 1 a for k = 0, 1 

hi . . 

A given map 

A 2 A 

is often called a “binary algebraic operation on A” since, for any two elements 
no, «i of A, 9 will produce a single element as a result: 


(ao,ai) 


X 


aodai =6{ao,ai ) 
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(A frequently used notation is to write the symbol for the operation between the 
symbols for the two elements of A, which are the components of the element of A 2 to 
which the operation is being applied). Thus, for example & is a binary operation on 
A = 2 (the set of truth values), and multiplication IR 2 — IR is a binary operation 
on the set A — IR of real numbers. 


Exercise 3.27 

What is the composite map 

? 

usually called? 0 


Exercise 3.28 

A binary operation 9 is said to satisfy the “idempotent law” if the composite 



1a 

is the identity. Does multiplication of real numbers satisfy the idempotent law? 
What about conjunction 2 x 2 — >- 2 of truth values? 0 


3.4 Equalizers 

/ 

Definition 3.29: Given A ;C B, an equalizer for f with g is any map E 1 > A 
such that 

(1) fi = gi 

(2) VT Vx [f.x = gx => 3! x [x = z'x]] 
as in the following commutative diagram: 


T 


E 



f 


g 




B 


Exercise 3.30 

Any equalizer i (as in the definition) is a part of A. If i is an equalizer, then for 
any x 


x e i 


fx = gx 


0 
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Exercise 3.31 

/ 

Suppose that E — A- A and E' — A are equalizers of the parallel pair A a : B. 
Show that E = E' by an isomorphism compatible with i, we say that i and i' 
themselves arc isomoiphic. 

Hint: Use the universal mapping property of equalizers to find candidate inverse 
isomorphisms. ^ 

Notice that the same proof would also show that any two products of X and Y 
are isomoiphic. This justifies a common abuse of language that allows us to speak 
of the equalizer of / and g or the product of X and Y . That is, any two equalizers 
(or products) are “equal” up to isomorphism, and this provides further examples of 
the “equations” among sets that we mentioned at the beginning of Section 3.2. The 
proof that the inverse image of a part along a map is unique up to isomorphism is 
the same as those given above. The proof that the sum is unique up to isomorphism 
is dual, i.e., it is the same proof with the arrows reversed. 

Because of their great importance in mathematics, we might want to assume as 
an axiom that equalizers always exist for any parallel pair f, g of maps. However, 
having assumed the existence of products and inverse images, it turns out that we 
can prove the existence of equalizers - indeed by two different constructions. 


Exercise 3.32 

The equalizer of the projection maps B~ ^ > B is the diagonal B — - IE. Given 
any A ' > B, we can form the single map ( f,g):A — >■ B 2 along which we can 
take the inverse image of the paid 8b- 


E B 


A 


(/.<?) 


<>B 

B 2 


Prove that if i is assumed to have the universal mapping property appropriate to the 
particular inverse image, it follows that i also has the universal mapping property 
required of the equalizer of / with g. a 


The other construction of the equalizer will only use that special case of inverse 


image known as intersection. Given A : 


/ 


: B , then each of them has a graph. 
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which is a part of A x B : 



AxB 
A A 


7s 


7 / 


A 


If we intersect these two parts of A x B, we clearly get a part of A x B that “sits 
over” the desired paid of A. To establish this fact rigorously, note that both of the 
graphs actually have the same domain A, thus, their intersection E M-A x B. 
being included (C) in both, participates in a commutative diagram 

E\ 

*/ 

A c y s — A x B 

where i /■ and i g arc the “proofs” of the respective inclusions. But i y, i g are both 
maps from E to A, and the unusually special construction actually leads to the 
following: 



Proposition 3.33: i f — i g . 


Proof: Since y/i f — y g i g , 

n AYfif = TtAYgig 

But by half of the definition of graph, 

X AYf = 1 A = TtAYg 

hence, if — i g . ■ 

Exercise 3.34 

Define i = if = i g as in Proposition 3.33. Show that fi = gi and that indeed i is the 
equalizer of / with g. Hence, the equalizer of two maps can always be constructed 
whenever we can form their graphs and intersect them. 0 
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3.5 Pullbacks 

The constructions of terminal object, cartesian product, inverse image, intersection, 
and equalizer are all special cases of one general construction called “limit”. In 
practice there are several other instances that often occur, a frequent one being 
“pullback” or “libered product”. We will show in Section 3.6 that the existence of 
equalizers and finite products suffices to guarantee the existence of all finite limits 
and also that the existence of pullbacks and the terminal object suffices for the 
existence of all finite limits. 


f f 

Definition 3.35 : Given any two maps with common codomain Aq AX- B A i a 

pullback (or fibered product) of them means a pair of maps ttq, n\ with common 
domain P that 


(1) form a commutative square foJto = f\it\ 



and, moreover, 

(2) are universal among such squares in that, 
for any T , ao, a\ 


/o«o = f\ a i =>■ 3!« [a 0 = Ttoa and a i = jtia] 



A i 
h 
B 


We point out that like products, equalizers, inverse images, and sums, pullbacks 
are unique up to unique isomorphism. The proof is once again that of Exercise 3.31, 
and we will speak of P as the pullback of / 0 and /) without explicitly mentioning 
7To and 7T i . 

The uniqueness of the induced maps into a pullback enables one to calculate 
their values at (generalized) elements: Suppose X Ao, X — ^ A\ satisfy 
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f 0 a 0 = f) a i , and so they define an arrow to the pullback denoted X p _ if 

T — ^ X is any generalized element of X, then the value {ao, ct\)x has the projec- 
tions 


7To((ao, a\)x) = ( 7To(flo i a 1 ))* = ciqx 
7ti((ao, a\)x) = (tx\ (ao, a\))x = a\x 

Hence, by uniqueness of maps with specified projections, we obtain 

(ao, a\)x = (aox, a\x) 

and hence the latter pair is the generalized element of P , which is the required 
value. 

Proposition 3.36: If products of two objects and equalizers exist, then pullbacks 
exist. 


Proof: Given data /o, f\ as above, first form the product A 0 x A\ with projections 
now denoted for i =0, 1. The square 


A) x Ax 


pi 


•A 


P 0 


h 


fo 


■B 


is usually not commutative, but we can form the equalizer P c — Ao x A \ of the 
two composites f 0 po , f\P\, that is the part of the product on which the diagram 
becomes commutative. If we then define 


ft® = Poi it i — pii 

we will have /ott 0 = fitti . If the (generalized) element (ao , a i ) of A 0 x A\ satisfies 
f 0 a 0 = /] a ] . then it will actually be a member of i, with a unique “proof” a; but 

(ao, ai) = ia =>• 
ao = Po(flo,«i) = Poiti = rtoa 
a\ = p\(ao,a\) = pfa = 7t\a 


as required of a pullback. ■ 

Proposition 3.37: The existence of pullbacks and terminal object implies the exis- 
tence of binary products, inverse images, intersections, and equalizers. 
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Proof: If B — 1, then there is only one possibility for f 0 , f\, and moreover any 
square ending in B is commutative. Hence, the diagram 



shows that product is literally a special case of pullback provided we have a terminal 
object 1 (of course in the category of sets we have a terminal object, namely a one- 
element set). We have previously seen that equalizers can be constructed using 
products and intersections (of graphs) or using products and inverse images (of 
diagonal parts). Further, we saw that intersection may be considered as a special 
case of inverse image. The proposition will be proved if we can show that inverse 
image is a special case of pullback; the interesting aspect, which requires proof, is 
considered in the following exercise. 

Exercise 3.38 

Suppose that 


/ 




/ 


is a pullback square and also that j is a monomorphism. Then it follows that j is a 
monomorphism. ■ 


3.6 Inverse Limits 

Now we want to describe the more general concept of limit. The data for a limit 
construction involve a family A, of objects and a family fj of arrows between them. 
Actually, we should specify a data type 


J 
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(or “directed graph”), where J is a set of indices for the arrows fj in specific data 
of this type and I is the set of indices for the objects Aj. In data of the given type, 
the domains and codomains arc required to be given by the maps d, c : 

A dU) A c{j) for all 1 J 

(We call the data type finite if I and J are finite. We do not require that the A, ’s 
be finite.) 


Definition 3.39: The limit (also called inverse limit) of given data A, , fj as above 
is given by a single object L together with a universal cone; here by a cone with 
vertex L and with base the given data is meant a family 

L A,- for 1 / 

of maps satisfying 



for all 1 J 


That the above cone is universal means that for any object T and any cone with 
vertex T and with the given data as base, i.e. for any family 

T A, for 1 'gJM I 


for which 


there is a unique 


fj a d(j) = a c (j) for all 


1 


J 


T 


L 


that preserves the cone structure such that 

Uj = Jt/a for all 1 -4- I 


For example, products of I factors are easily seen to be limits according to the 
data type 


0 — r/ 
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for in the case of products there are no given “bonding maps” fj and correspondingly 
no equations presupposed in the universal property. On the other hand, equalizers 
are limits according to the data type 


C 

where d, c are the two distinct constant maps, because any equalizer data 


A 



involves two given arrows between two given objects according to the scheme that 
both arrows have the same domain A (hence constancy of d) but also the same 
codomain B (hence constancy of c). 


Exercise 3.40 

Specify a pair of mappings 2 j i 3 so that limits according to this data type arc 
pullbacks. 0 


Once again we note that limits are unique up to isomorphism. 


AXIOM: FINITE INVERSE LIMITS 

S has all finite limits. 


This means that there is a limit for any data for any finite data type whatsoever. 
Luckily the general property can be deduced from simpler information. 

Theorem 3.41: In any category, if products and equalizers exist, then limits ac- 
cording to any data type can be constructed. 

To help in understanding the following proof notice that Proposition 3.36 is a 
special case of this theorem and that the construction used in the proof of that 
proposition is generalized here. 


Proof: Given the data type 



C 


and also given A, , f j such that 


A/ 


fj. 


A/' 


whenever 


d(j) = i 

c(j) = i' 
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consider the products n A, and also J] A c 


Thus, for example, if the data type is 


expressing the graph 


h h 


then the first product has five factors, namely all the A, ’s, but the second product 
has four factors that (since c has only two values, etc.) arc really just Ai taken twice 
and A 3 taken twice. 

Now between the two products we can construct two maps: 


II Ai — II Ac 


where for all j 


« Pjf = fjPd(j) 


which determines / by determining all of its J coordinates as being (essentially) 
all the given bonding maps f) . The other map p is simpler for it does not depend 
on the given bonding maps at all. 


■ II ArU) 


Vi PjP Pc(j ) 


Now if we arc given any family of maps 

T Ai for 1 -U I 
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it can be considered as a single map T > ■ 11/ A, . Consider the diagram 

7- - T\iAi - P ^UjA cU) 

By construction of p and /, (a,-)/ will satisfy the equation 

P(ai)i = fiat)! 


if and only if the given family satisfies the family of compatibility equations 


Hence, if we let 


ar = fjcii 


whenever 


d(j) = i 
c{j) = i' 


L — 4- n A, be an equalizer of p, f 


then precisely such “/) compatible” families will factor across the monic e. That 
is, if we define 

7tj = pie for I ' >- / 


then Ttj will have the universal mapping property required of the limit. ■ 


Corollary 3.42: If pullbacks and a terminal object exist in a category, then all 
finite limits exist. 


Proof: Proposition 3.37. ■ 

Exercise 3.43 

A terminal object is an inverse limit. 

Hint: The data type for the limit has both I and ./ empty. 0 

Exercise 3.44 

f 

For any given single map X >- Y, the pullback of / along itself is often called 
“the equivalence relation associated to /”. This pullback involves a pair of maps 

Xl 

K A X and hence a single map e to X x X. For any parallel pair T g i X, 

x 2 

fx i = fx 2 iff (xi, xf) belongs to e. (See also Proposition 4.8 and Exercises 4.22 
and 4.23.) 0 


3.7 Additional Exercises 

Exercise 3.45 

Show that the category of finite-dimensional vector spaces has all finite limits. 
Hint: They arc computed using the finite limits in S. 



76 


Finite Inverse Limits 


Exercise 3.46 

(a) Show that in the category of finite-dimensional vector spaces the sum and the 
product of two vector spaces V, V' are the same vector space that we usually 
denote V © V'. It has both projections and injections. 

This means that if V is a vector space there is both a diagonal linear transformation 
V V © V and a codiagonal linear transformation V © V -A- V. Therefore 
we can define a binary operation on linear transformations T,T' :V — >■ V', whose 
result is the threefold composite 

T + T' =V-^V ®V^V' ®V'^V' 

The middle arrow is called T ® T 1 . 

(b) Show how to define T © T', that + coincides with the sum of linear transfor- 
mations met in linear algebra, and that + makes the linear transformations from 
V to V' into a group. 

Exercise 3.47 

Show that the category of groups has all finite limits. 

Exercise 3.48 

(a) Show that the categories S/X have all finite limits. 

Hint: They arc computed “fiberwise”. 

More generally, 

(b) for any category C that has all finite limits, show that C / X has all finite limits 
for any X in C. 

Exercise 3.49 

Show that the category of pointed sets 1/5 has all finite limits. 

Exercise 3.50 

Sets with action: Given a set A (frequently referred to as an “alphabet”), by an 
“action” of A on a set S is meant any mapping S x A -A- S. 

(a) Show that sets equipped with given A-actions form a category if we take 
as arrows from (S,8) to (S' , S') those mappings S-^S' such that tpS = 
S'(<p x I A ). (A special case where A has two elements was considered in Exer- 
cise 1.30 (d).) 

When 8 is given, it is convenient to abbreviate 8(s , a) simply by sa and call 
it the action of a on s. 

(b) Show that the category of A-sets has sums and finite limits. 
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Exercise 3.51 

M-sets: A monoid is a set M with an associative binary operation (that is mim'm") = 
for any three elements) and an element 1 which is an identity (that is 
I m = m — m 1 for any m). If M is a monoid and X is a set, a ( right) action of M 
on X is a mapping X x M s > X such that x 1 — x and (xm)m' = x(inm') (where 
we are writing xm for S(x, m )). A set X equipped with an action of M is called 
an M-set. A morphism of M-sets from X to X' is a mapping X — X' that is 
equivariant, (i.e., it satisfies an equation like that above for A-sets.) 

(a) Show that M-sets form a category. 

(b) Show that the category of M-sets has sums and finite limits. 

Actually the category of A-sets is a special example of a category of M-sets, 
where M is the special free monoid 

M — A* — 1 + A + AxA + AxAxA + ... 

What should the infinite “sum” mean? 

Exercise 3.52 

If X is the category formed from a partially ordered set X, show that a product of 
two objects is exactly the same thing as a greatest lower bound. 
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Colimits, Epimorphisms, and the Axiom of Choice 


4.1 Colimits are Dual to Limits 

Each notion of limit has a corresponding “dual” notion of colimit whose universal 
mapping property is obtained by reversing all arrows. Thus, for example, the termi- 
nal object 1 has the limit property that every object X maps uniquely to 1, whereas 
the coterminal (or initial) object 0 has the dual property that every object Y has a 
unique map from 0. The limit concept of product, i.e. a span (= pair of arrows 
with common domain) A 4^- A x B B such that spans A C— X — B as 
below uniquely determine the {f,g) in 

A 


B 

(which expresses the universal mapping property of projections), dualizes to the 
concept of coproduct, i.e. a cospan of injections A -A*- A + B -A— B such that 
cospans A — C Y -A— B as below uniquely determine the j ^ in 



The universal mapping property for injections to the coproduct should be recognized 
immediately as that for the injections to the sum. Similarly, the concept of equalizer 



78 



4.1 Colimits are Dual to Limits 


79 


for /, g, namely a (necessarily mono!) arrow E • > A for which arrows X A 
“equalizing” / and g (fx = gx ) uniquely determine the X x ^ E, as in 

X 

i 

_i 

x i 
Y 

E A ZB 

dualizes to the concept of coequalizer for /, g, namely an arrow B — Q for which 
aiTows B ' Y “coequalizing” / and g (i.e., yf = yg ) uniquely determine the 
(2^ Fin 



A 


f 


g 



Q 

i 

i- 

i-' 

Y 


Y 


which we will study in more detail later. 

Exercise 4.1 

Formulate the notion of “pushout,” the colimit concept dual to pullback. 0 

Exercise 4.2 

Formulate the general notion of the colimit of a data type 


Exercise 4.3 

Prove that the existence of pushouts and an initial object implies the existence of 
coproducts. 

Hint: Use the dual of Proposition 3.37. 0 

Exercise 4.4 

Find sufficient conditions to guarantee the existence of all finite colimits. Prove that 
the conditions are (necessary and) sufficient. 


Hint: Use the dual of Proposition 3.41. 


0 
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We require the axiom that the category of abstract sets and mappings has all finite 
colimits: 

AXIOM: FINITE COLIMITS IN 5 

S has all finite colimits. 

This axiom is dual to the axiom of finite limits. It includes the axiom of the empty 
set and guarantees that finite sums, coequalizers, and pushouts all exist in S. 

4.2 Epimorphisms and Split Surjections 

Although seemingly not a limit notion, the concept of monomoiphism also has its 
dual, that of epimorphism: 

Definition 4.5: An epimorphism f is a map that has the right-cancellation prop- 
erty 'Vfi, fiifif = fif =>■ fi — fi]- 

We will use the term epimapping (or epimap) interchangeably with the term 
epimorphism for mappings in S and also use the term epic as an adjective. 

Exercise 4.6 

Any coequalizer is an epimorphism. 

Hint: Recall the dual proof! 0 

Exercise 4.7 

Recall the definition of cograph from Section 2.1. Show that the cograph is an 
epimorphism. Show that any retraction of the injection B A + B is the cograph 
of a unique mapping from A to B. (These results are dual to Propositions 3.24 and 
3.25.) 0 

We will study below the relationship between epic and “surjective” (which in 
form is not dual to “injective,” although it does turn out to be equivalent to the dual 
in very particular' categories such as S). 

There is actually an important link between cancellation properties and limits. 

Proposition 4.8: A map i is a monomorphism if and only if when we form the 
pullback ofi with itself the two projections are equal. 

Proof: Suppose the square below is a pullback and moreover ttq = 7t\ = n . 
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A 

i 

B 

If /'jco = ix i , then by the universal property of pullbacks, there is a unique T P 
for which both xo = ttqx,x\ = n\x. But since we have assumed no = it\ , this yields 
xo — x\ ; i.e. i has left-cancellation. 

Conversely, assume i has cancellation and that jto, 7t\ form a pullback of i with i . 
Then in particular - ijT 0 = iit\, hence by cancellation ttq — Tt\ . ■ 

Exercise 4.9 

Without considering the internal picture, show that a map is epimorphic iff two 
normally different maps in the pushout of the map with itself are equal. 0 

Using simple properties of the sets 1 and 2, one can show that in the category of 
abstract sets and mappings: 

Proposition 4.10: A mapping X Y is an epimapping iff it is surjective. 

Proof: Assume p is epic, that is, it satisfies the right-cancellation law 
V'o P = Vh P =>• V^o = Vh x Y ==S V 

Y2 

for all V. We want to show that p is surjective, that is 

1 

\/y3x[px = y] / 

x- — p — 

where y is any element of Y with domain 1 . To arrive at a proof by contradiction, 
suppose that contrary to the desired conclusion there is at least one y ( ) for which 
there is no corresponding x, i.e. px f yo for all x. Take V = 2 and let i jr\ be the 
characteristic function of (the one-element part) yo, but let fa be the “constantly 
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false” map Y — >-2. Then for all elements x : 1 — ^X of X 
{ff\p){x) = fsi(px) 

= false since px f yo and \j/\ (y) = true iff y = y 0 
= fo (px) 

= ('l'op)x 

hence, xfip = foP since 1 is a separator. But then j/\ = xfo since we assumed p 
epic. However, by the way we defined fo and f\, xfiyo f foyo, and so we reach 
a contradiction, showing that there is no such yo (i.e., p is surjective). 

For the converse part of the proposition, assume that p is surjective and try to 
show the right-cancellation property; thus, suppose xfip = j/ 0 p, where V is now 
arbitrary, as are fo, f\. We will use the fact that 1 is a separator. Let y be an arbitrary 
element of Y. By surjectivity of p there is an element x : 1 — >- X such that y — px, 
and so xjt 0 y = jto(px) = {fop)x = (Vo P)x = VhC px) = xfiy. 


1 



ipi 


Hence, xfo = x(/\ since 1 separates. Because this cancellation holds for all V, we 
have proved that p satisfies right-cancellation, i.e. it is epic. ■ 

If we strengthen the notion of surjectivity to demand the extreme case of existence 
of prevalues for all generalized elements, not only elements with domain 1, we 
obtain the strong condition of split surjectivity. 

Definition 4.11: An arrow X Y is split surjective if and only if 

T 

\/T\/y3x\px = y] 

x- — j — *r 

One often expresses this property of p by saying that every y has a lift x. To see 
why, redraw the triangle with p vertical as is often done for continuous mappings 
in topology. 

Proposition 4.12: An arrow X P >- Y is split surjective iff there exists a section 
for p. 

Proof: If p is split surjective, let T = Y and y = I y , the identity mapping of Y. 
Then, as a very special case of the split surjectivity, there exists x = s such that 




4.2 Epimorphisms and Split Surjections 


83 


Y 

/ 

S / 

/ 


In other words, p has a section s (usually there will be many). 

Conversely, suppose p has a section s ; then we are to show that p is split surjective. 
So consider any T and any T Y . There is an obvious way to attempt to construct 

the x we need: try x = f sy 

T 

S 

X Z ~ Y 

p 

We then have to show that the x so constructed really maps back to y under /;, as 
required for split surjectivity: 

px = p(sy) by definition of x 
= ( ps)y by associativity 
= 1 Y y since s was a section of p 

= y 

Thus p is split surjective if it has a section, and hence the proof of the proposition 
is complete. ■ 

Since split surjectivity trivially implies surjectivity (just take the special case 
T = 1), we have, by means of the logical equivalences in the last two propositions, 
actually already proved the following proposition. However, we will also give a 
direct proof in the hope of further clarifying the relationship between the concepts. 
Note that the proof that follows is valid in any category. 

Proposition 4.13: Any mapping p that has a section is an epimorphism. 

Proof: Let s be a section for p. We must show that p has the right-cancellation 
property; thus, assume 

fop — tjs i p 

fi 

where V is arbitrary. But then 

WoP)s = Wip)s 
fo(ps) = flips) 

fo — fi since ps = l Y 




as was to be shown. 
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4.3 The Axiom of Choice 

The converse of the last proposition, namely that there arc no epimappings except 
for those that actually have sections s, is not at all obvious. Once we have a section in 
hand, cancellation becomes the result of a concrete process of calculation, as in the 
preceding proof, unlike the abstract leap involved in just “doing the cancellation”. 
On the other hand, where the section comes from may also be mysterious. The 
usual point of view is that the converse is true for constant abstract sets (and is in 
fact a strong testimony to the “arbitrariness” of the maps in that category), whereas 
it is obviously fa Ise for sets that are less abstract, less constant, or both (as we will 
soon see). The name of the converse is 

AXIOM: THE AXIOM OF CHOICE 

Every surjective mapping is split surjective (i.e., every epimap has a section). 
Much ink has been spilled over this axiom. 

Proposition 4.14: Any section s for a map p is a single procedure that 
simultaneously chooses an element from each fiber of p 

where the important notion of fiber is given by 

Definition 4.15: For each X — '->■ Y and 1 — Y, the fiber of p over y is the 
domain of inverse image of the singleton (— one element ) part y along p 



Proof: (of Proposition 4.14.) Since ps = ly, p{sy) = y, i.e. sy e p 1 [y] ; mother 
words sy is a member of the fiber of p over y, but this is so for all v. ■ 

Now (as further axioms of higher set theory reinforce) any reasonable family of 
sets parameterized by a set Y can be realized as the family of fibers of some single 
map p with codomain Y. The sets in the family arc all nonempty if and only if p is 
surjective (see Exercise 4.16). Hence, the axiom of choice, “Every surjective map 
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is split surjective,” says that any family of nonempty sets has at least one choice 
map that chooses an element from each set in the family. 

Exercise 4.16 

Show that the mapping p is surjective iff each of its fibers has at least one element. 

0 

Exercise 4.17 

Show that the mapping p is injective iff each of its fibers has at most one element. 

0 

Thus, a mapping is bijective if and only if each of its fibers has exactly one 
element. 

Exercise 4.18 

Show that the mapping from real numbers to real numbers defined by the formula 

/(*) = x 2 (x - 1) 

is surjective and hence has sections in the category of abstract sets and arbitrary 
mappings. Show, however, that none of these sections are in the continuous category 
(see Exercise 3.5) because each one must have a “jump”. 0 

As we will see in Appendix B, an equivalent form of the axiom of choice, often 
used in proofs in analysis and algebra, is 

The Maximal Principle of Zorn 

4.4 Partitions and Equivalence Relations 

The dual notion (obtained by reversing the arrows) of “paid” is the notion of 
partition. 

Definition 4.19: A partition of A is any surjective mapping p with domain A. The 
fibers of p are called the cells of the partition. 

Proposition 4.20: Let A — ^ I be surjective, and for each 1 -A- / let A, be the 
cell of p over i. Then all the A,- are nonempty parts of A; every element of A is in 
exactly one of the A,-. 

By way of explanation, note that every (not necessarily surjective) mapping with 
domain A gives rise to a partition of A, as described in the proposition, by restricting 
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consideration to those elements of the codomain at which the fiber is nonempty. In 
other words, a general mapping specifies more information than just the partition 
of its domain in that it specifies also the size of the part of the codomain where the 
fibers arc empty. By restricting to the case where the latter part is empty (i.e., to the 
case of surjective mappings), we are in the case of mappings that specify no more 
(and no less) than a partition of the domain. This “justifies” Definition 4.19. 

(A similar discussion could have been given for the dual concept “part of B”. That 
is, any mapping with codomain B gives rise to a part of B, namely the part whose 
members are just those elements of B that actually are values of the given mapping. 
However, a general (not-necessarily-injective) mapping specifies much more infor- 
mation than just this “image” part since the members of the image will have some 
“multiplicity,” i.e. will be values of the map at more than one element of the domain 
(multiplicity = size of fiber). Injective mappings (where all nonempty fibers have 
multiplicity exactly one) are just parts , without any additional information. 

Here is a typical cograph picture of a partition of a set A into three parts: 



In terms of partitions, the axiom of choice can be rephrased by saying that for any 
partition p of any set, there exists at least one choice function s that chooses an 
element from each cell of the partition 


S_ 

A v 1 L e - «(*) e A i 

for all 1 — ^ /, where 
p S — i j- Ai is the fiber of p at i. 

The axiom of choice seems quite plausible because we are allowing “arbitrary” 
mappings s. In categories in which only geometrically “reasonable” mappings are 
allowed, the axiom of choice is usually not true; but this only points out that such 
categories arc distinct from the category of constant sets and arbitrary maps, which 
itself exists as an especially simple extreme case with which to contrast the others 
(Cantor’s abstraction). Some of the opposition to the axiom of choice stems from 
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its consequence, the Banach-Tarski Paradox, which states that a solid ball can 
be shattered into five parts of such an unreasonable “arbitrary” nature that they 
can be reassembled into two solid balls of the same size as the original. The ab- 
stractness of the sets, correlated with the arbitrary nature of the mappings, makes 
such paradoxes possible, but of course such paradoxes are not possible in the real 
world where things have variation and cohesion and mappings are not arbitrary. 
Nonetheless, since Cantor mathematicians have found the constant noncohesive 
sets useful as an extreme case with which continuously variable sets can be con- 
trasted. 

Closely related to the notion of partition is the concept of equivalence relation. 

Definition 4.21: A relation Rfrom X to Y is a part R ^ po - pi J . x x V of the product 
ofX and Y. IfX — Y, we speak of a relation on X. The opposite of a binary relation 
R is the binary relation from Y to X with projections p \ and po. It is denoted R op . 

Notice that there is an abuse of language in omitting the projections involved in 
defining the binary relation R - and as a mapping R op has the same domain as R ! 


Exercise 4.22 

R op actually is a paid of Y x X. 

Hint: Use the “twist” map r x .y 0 

Suppose that A — h*- / is a partition, and let the following diagram be a pullback: 


Rn 


Pi 


t 

A 


Po 


p 


A 

p 

I 


Exercise 4.23 

The resulting arrow R p ''lAdf A x A is a monomorphism. 0 

Thus, R p is a relation on A. It has some special properties we are about to explore, 
but notice that the proof that (po, pf is monic does not depend on the fact that p is 
epi. We will need that only when we come to the proof of Proposition 4.32 below. 
With A as test object and the identity mappings to the vertices of the pullback above 
we obtain a mapping r : A — R p : 



88 


Colimits, Epimorphisms, and the Axiom of Choice 



The mapping r satisfies p$r = \ A = p\r. In general. 

Definition 4.24: A relation R X x X on X is called reflexive iff there is a 

mapping r : X — > R such that p^r = l x = p \ r. 

Exercise 4.25 

Recall the diagonal map X — A X x X. A relation R is reflexive if and only if it 
contains the diagonal part, or if and only if for any (generalized) element v of X 
we have (x, x) is a member of R. 0 

Combining the fact that (po, p\) is monic with uniqueness of mappings to prod- 
ucts, we see that for a reflexive relation, r is unique. Thus, a relation either is or is not 
reflexive. In Section 10.4 we will study reflexive graphs determined by two sets E 
and V and three mappings: s,t:E — >■ V and r : V — > E such that sr — 1 v — tr. 
In that case r is paid of the reflexive graph structure and need not be unique because 
(s, t) need not be mono. 

The relation R p has two additional important properties we now define. 
Definition 4.26: A relation R X x X on X is called symmetric iff 

R op R- 

Exercise 4.27 

A relation R is symmetric if and only if for any (generalized) element (x, y) of 
Xxlwe have 

{x, y) e R ==► {y, x) e R 

if and only if the restriction of the twist mapping r x ,x to R is contained in R. / / 


Exercise 4.28 

The relation R p is symmetric. 


0 
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A third important property of R p is the following: 

Definition 4.29 : A relation R is called transitive if and only if for any ( generalized ) 
elements (x, y) and (y, z) of X x X we have 

(x, y) e R & (y , z) e R => {x, z) e i? 


Exercise 4.30 

The relation /? ; , associated to a map A I is transitive. ^ 

Combining these properties of R p , we make the following definition: 

Definition 4.31: A relation R on X is called an equivalence relation iff it is 

reflexive, symmetric, and transitive. 

Now if we start with an equivalence relation R on A, we may form the coequalizer 
of R jg X , which we denote by X -^4- P R . This is a partition of X. (We obtain a 
partition of X by forming the coequalizer of any two mappings with codomain 
X - it is the special properties of equivalence relations that allow the next result.) 
Taking the equivalence relation of a partition of X and taking the partition from an 
equivalence relation on X are inverse processes: 

Proposition 4.32: If p is a partition ofX, then p = p Rp . If R is an equivalence 
relation on X then R = R Pr . 

Exercise 4.33 

Prove Proposition 4.32. 0 


4.5 Split Images 

There is a version of the choice axiom that does not assume the given map is 
surjective and correspondingly produces another map that is in general somewhat 
less than a section. 

f 

If X — — - Y is any mapping, provided X is nonempty, there 
exists Y -A- A for which fgf — f . 


Exercise 4.34 

Show that the principle just enunciated is equivalent to the axiom of choice. (What 
do you need to assume about the category to prove this equivalence?) 


Proposition 4.35: If fgf — / and f is epimorphic, then fg = \ Y (i.e.,g is a section 
for f). 
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Proof: fgf = 1 Y f and / is right-cancellable. ■ 

Proposition 4.36: If fgf — / and f is monomorphic, then gf = \ x (i-e., g is a 
retraction for f ). 

Proof: Exercise. ■ 

It will be recalled that we proved earlier that if X Y is monic and I/O, then 
/ has a retraction using only the existence of complements for all parts and that 
any nonempty set has an element; thus, this monic case of the fgf = f principle 
does not involve “choice” functions. The fgf = f principle can be used to derive 
the main properties of the image factorization of a mapping / (which we illustrate 
with external and internal diagrams) 


X Y , . 

f = ip 

i injective 

p surjective 

I 




for any / by strengthening it to a split image (meaning p has a splitting or section 
5): 



f = ip 
i injective 
ps = 1/ 


Exercise 4.37 

If the factorization f = ip is a split image for /, then i is the paid of y whose 
members are exactly all those elements of Y that are values of /; i.e., 


vr 


Y [yei ^ 3x[y = /*]] 


0 


The section s in the split-image concept is not determined by / (even though the 
image factorization f = ip is essentially determined), but on the other hand it makes 
some calculations somewhat easier. The next few exercises are concerned with 
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showing how split images can be derived from the pseudosections g for arbitrary 
nonempty X. 


Exercise 4.38 

If fgf = /, we can explicitly define (by composition) another “better” g for which 
both 


g 


X 



f 


fgf = f 

and 

gfg = g 


The improvement process goes no further: g = g\ i.e. if g already satisfies both 
equations, the process (“) will not change it. 0 


Exercise 4.39 

(jef 

If f = ip, where p has a section s and i has a retraction r, and if we define g = sr, 
then the resulting g will already satisfy both fgf = f and gfg = g. 0 


Exercise 4.40 

Suppose fgf = f and let i be an equalizer of fg with l Y , where Y is the codomain 
of /. Show, using the universal mapping property of equalizers, that a unique 
map p exists for which / = ip. Show that pgi = 1 E and hence that if we define 

(j e f (j e f 

5 = gi,r = pg we get that ps = I / and ri = 1 E . Moreover ir = fg and sp — gf. 
How must g be related to / in order to have further that g = sr? 0 


Although most categories do not have split images, often they do have image 
factorization f = ip in the sense that i is the smallest part of the codomain through 
which / factors and p is the proof that / belongs to i . (It then follows that p is an 
epimap.) 

With images available we can define a composition of relations and give another 
characterization of transitive relations. 

If R l ' Po ’ Pl t x x Y is a relation from X to Y and S (g0 ' 9l > - F x Z is a relation 
from F to Z, we denote by S x Y R the pullback of p\ and go; thus, we have 
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Definition 4.41: The relational composite of R and S is the image 
So R = I{ PoP 2 , qi q 2 ) of the mapping (p 0 p 2 , qiq 2 ) : S x Y R — ^ A x Z. 

f 

Note that the graph yp of a mapping A — >- Y is a relation from X to Y. 

Exercise 4.42 

If X Y and Y — Z then y g p = y g o yp. 0 

Exercise 4.43 

A relation R o pa ' p, 'f x x X on X is transitive iff R o R c XxX R. 0 


4.6 The Axiom of Choice as the Distinguishing Property 
of Constant/Random Sets 

Much of mathematics revolves around determining whether specific maps / have 
sections s or not. In a general way this is because the sets of mathematical interest 
have some variation/cohesion within them and the maps (such as s) in the corre- 
sponding categories are accordingly not “arbitrary”. However, for the category S 
of constant sets and arbitrary maps, the axiom of choice is usually assumed to hold. 

f 

The name “choice” comes from the following observation: A map X -- -> Y can be 
viewed as a family of sets 


X y = {x\fx = y} 

parameterized by the elements y of Y. Then a section s for / is a single rule that 
chooses one element from each set in the family: 

fs= ly =* f(sy) = y =>• s(y) € X y for each y 

The notion of paid (including membership in a part and paid defined by a condition) 
as just used in the explanation of the word “choice,” can itself be entirely explained 
in terms of composition of mappings (without assuming rigid membership chains 
or imposing an a priori model for every paid). 

The axiom of choice implies two other properties (which will not both be true of 
genuinely variable sets), both of which can be stated in terms of parts. The Boolean 

property states that for any paid i 0 ■ do c >- X of any set X there is another paid 

i] : A i r 3- A of A (the complement of Ao) such that A is the coproduct of A 0 and 

Ay, i.e. for all Y , for any pair of arrows fo : A — Y , f\ : A\ — > Y with codomain 
Y there exists a unique / : A — >■ Y , making the diagram below commutative: 
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It can be proved that the Boolean property follows from the axiom of choice. 
Now we only point out the randomness implied by the Boolean property: The maps 
fo, f\ can be specified independently (no compatibility required on any "boundary” 
between Ao, Ai) and still be paid of a single map (on the whole X ) that the Boolean 
category allows; moreover, the uniqueness of the extended map / shows that Ao, A\ 
together exhaust X (for if there were any additional part. / could be varied on it 
without disturbing the condition that / restricts to the given /o, /i). 

There are many important categories of variable sets that satisfy the Boolean 
property without satisfying the axiom of choice. The most important such examples 
are determined by groups, and in Chapter 10 we will study enough group theory to 
understand in a general way how these examples operate. 

The other very restrictive consequence of the axiom of choice is sometimes 
called the localic property or the property that “parts of 1 separate”. Recall from 
Exercise 2.44 that in S / X the terminal object has many parts in general, and indeed 
they separate maps in that category. In all of our categories there turn out to be 
suitable objects B such that “parts of B separate,” meaning that there are enough 
elements whose domains are (domains of) parts of B to distinguish maps between 
any X, Y: 


Y 

3A3i3x i is mono and 
fix ^ fix 


A category of sets is called localic if this holds for B = 1 . 

It can be shown that the localic property follows from the axiom of choice. Al- 
though most of the examples of categories of variable sets are not localic, there arc 
a great many that are localic even without satisfying the axiom of choice. The most 
important localic examples correspond to topological spaces. In Exercise 10.27 we 
will learn enough about topological spaces to comprehend how sets varying over 
a topological parameter space “restrict” from one open region to a smaller one. 
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There arc a great many groups and a great many topological spaces, and each 
one of each gives an example of a category of variable sets. But examples that are 
both localic and Boolean and yet do not satisfy the axiom of choice are harder to 
come by and indeed have only been known since Cohen’s 1963 discoveries. The 
study of such “independence proofs” is a part of the multifaceted lore of variable 
sets. 


4.7 Additional Exercises 

Exercise 4.44 

Prove Proposition 4.20. 

Exercise 4.45 

(a) Show that the category of finite-dimensional vector spaces has finite colimits. 

(b) Show that in the category of finite-dimensional vector spaces epimorphisms 
arc surjective linear transformations and, moreover, any epimoiphism has a 
section. 

Exercise 4.46 

(a) Show that in a slice category S/X a moiphism h from f:A — *- X to 
g : B — 5- X is an epimoiphism iff h is a surjective mapping in S. 

(b) Show that the slice categories S/X have finite colimits. 

Hint: They arc computed using colimits in S. 

Exercise 4.47 

(a) Show that epimorphisms in the category of M-sets are equivariant mappings 
that are surjective. 

(b) Show that these categories have finite colimits. 

Hint: They are computed using colimits in S. 

Exercise 4.48 

(a) Show that the category of partially ordered sets has finite colimits. 

(b) Show that epimorphisms in the category of partially ordered sets are surjective 
(order-preserving) mappings. 
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Exercise 4.49 

Describe equivalence relations in the categories S/X. 

Exercise 4.50 

Describe equivalence relations in the categories of M-sets. 

Exercise 4.51 

(a) Describe equivalence relations in the category of groups. In the category of 
groups equivalence relations are called congruences. The congruences on a 
group correspond bijectively to the normal subgroups. 

(b) Show that the analogue of Proposition 4.20 holds in the category of groups 
(where it is usually called the First Isomorphism theorem.) 

Exercise 4.52 

Show that the axiom of choice holds in the categories S/X. 

Exercise 4.53 

Show that the axiom of choice does not hold in the category of groups. 

Exercise 4.54 

Show that the axiom of choice does not hold in general in the category of M-sets. 

Hint: There is a very small counterexample using the action of a one-letter alphabet 
on sets with at most two elements. In fact, it can be shown that the axiom of choice 
fails in M-sets unless M is the monoid with only one element (in which case M-sets 
are equivalent to S). 

Exercise 4.55 

On the other hand, if a monoid has an inverse for every element (i.e., is a group G), 
then as mentioned in Section 4.6 the Boolean property holds for G-sets. 

Hint: Consider the nature of monomorphisms in G-sets; they are rather special. 
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Mapping Sets and Exponentials 


5.1 Natural Bijection and Functoriality 

The essential properties of the product operation can be summed up by the figure 


P 


X^Y 0 x Y\ 

X — ^To> X — >Tt 


It 


where the horizontal bar will be interpreted in such contexts to mean there is a 
natural process that, to every arrow of the type indicated above the bar, assigns one 
of the type indicated below the bar, and there is also a natural process from below 
to above, and these two natural processes are inverse to each other in the sense that 
following one process by the other gives nothing new. Of course, in the example of 
products the two processes in question arc supposed to be 


(1) taking the components of a map whose codomain is a product, and 

(2) “pairing.” 


Just from the idea that there should be such a couple of inverse processes, we can 
(by a “bootstrap” procedure) discover more specifically how the processes must 
work. Namely, from the presumption that P should function for all X, To, Y\, and 
for all maps as indicated, we can deduce two special cases: Let To, Y\ be arbitrary 
but suppose X — To x Y \ ; then, we make the very special choice “above the bar” of 
I Yli x y , which will correspond to something specific, namely, the projections below 
the bar 


Tp xh^foxF, 

T 0 x T, T 0 , T 0 x Ti T, 

Returning to the general X, the same projections arc the means by which, through 
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composition, the general top-to-bottom process is effected as follows: 

x t 0 x t, 

x Yq, X Y l 

Of course the bottom-to-top process is usually just indicated by pairing (the result 
of which is often denoted by brackets), 

X < ^- > T 0 x Y\ 

t 

X -A- Yq, X -A- Yl 

but this can also be analyzed further: First consider the special case in which X is 
arbitrary but both Yq = X and Y\ = X; then, we have the possibility of considering 
the very special pair of maps below the bar, which by the pairing process will lead 
to a specific map 

X -^x,x -^x 

known as the diagonal = (lx- lx) - The diagonal helps via composition to effect 
the general case of the pairing process provided one first develops the “functoriality” 
of product (which can also be regarded as deduced from the basic P ): 

If Xo —= To- X\ — 1— Yi are any two mappings, there is an induced map 

X 0 x X, ^ T 0 x Yi 

called the (Cartesian) product of the two maps that is characterized by the equations 

XoxXx /oX/l > r 0 X Yl 

Pkifo x /i) = fkPk 

k = 0,1 



or, in terms of values, by 



(/o x /i)(x 0 , Xi) = (f Q XQ, fiXi) 
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Exercise 5.1 

(“Functoriality of product”) If Y (} — T- Z 0 , Y\ — Z\ arc further maps, then 
(go x grX/o x fi) — ( g 0 fo ) x (gift) 

0 


Now the analysis of the pairing in terms of the product of maps and composition 
with 8 X is just this: 

If X 0 = X, X! = X and if X Y 0 , X Y ] then 

(fo, fi) = (/o x f\)8 x 

as is easily checked by following both sides of the equation with the projections. 

This two-level method, applied here to summarize the transformations possible 
with the product construction, will be applied to many different constructions later, 
so let us restate the levels: The crude idea that there should be a natural invertible 
process j, f between maps of two kinds is refined to a precise suggestion of how 
the process can be carried out, namely, to apply a functorial construction and com- 
pose with a specific map; a relationship between the two levels is essentially that 
the crude idea of the invertible process, applied to a very special case and to the very 
special identity map, yields that specific map, which can be used in composition to 
help effect the process. 


5.2 Exponentiation 

We will now discuss another important construction, that of exponentiation or 
mapping set, whose description involves a strictly analogous two-tier process for 
objects X, Y and B: 

Y b 


X- 


X x 5 


It 


Immediately we can derive from this crude idea what the elements of Y B must be 
by considering the special case X — 1 : The process 

Y b 


1 


5 


I 


must be an invertible process (recall from Exercise 3.22 that 1x5 = 5), i.e. there 
must be just as many elements of Y B as there are actual maps 5 — >- Y . 

Now if we are given any map 

/ 


X x 5 


Y 
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whose domain is equipped with a product structure (frequently one refers to such 
/ as a function of two variables), then for each element 1 — ^ I of I we can 
consider (with x the constant composite: B — >- 1 — ^ X ) 

X X B 


B 




fx 


Y 


In other words f x is the map B — >- Y whose value at any element b is 

Mb) = f(x, b ) 

That is, such a single map / on a product X x B gives rise to a family, parameterized 
by X, of maps B — >■ Y . Depending on the size of X and on the nature of /, one 
may have (but usually will not have) the following two properties: 

(1) Every map B — >■ Y occurs as f x for at least one 1 — X (i.e., every B — >■ Y 
has at least one “name” provided by the scheme X, /). 

(2) f X] — f X2 only when x\ = xo, (i.e., “nameable” maps B — 5 - Y have only one 
“name”.) 

In case both properties arc true, one often writes Y H in place of X and calls the map 

D f r f 1 R 

Y x B — 3 - Y evaluation ( rather than calling it /). For any B — 3 - Flet 1 — a- Y 
be the unique element of Y B guaranteed by the properties ( 1 ) and (2); then we have 
for all /, b 

, b 


■ B 


(tv 
Y B x B 


f eval( r / n , b ) = fb 


eval 


■Y 


Such a universal map-parameterizing scheme (one which does enjoy both properties 

(1) and (2)) will have a unique relationship to any map-parameterizing scheme X, f 

f f 

(in which X x B Y and any 1 — X names B — X- X), as expressed by the 
natural invertible process 


X 


-p 


It 


XxB 


namely, 


r px = pY 

Thus, this extended naming process is related to the evaluation map by 


eval( r / n .r, b) = fix, b) 
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or 

eval ( r / n x 1 B ) = f 

For any /, r is the only map for which the latter equation is true, and thus we 
can say briefly that any / is uniquely derivable from the evaluation map. 

As an example imagine that the elements of B represent particles of a continuous 
body of matter (such as a cloud); also imagine that a set E represents the points of 
ordinary space and a set T represents the instants of time within a certain interval. 
Then, 

E b = placements of B 

is a set, each element of which determines an entire map B — 5 - E telling where in 
E each particle of B is (i.e., a placement of B in E). Thus, a motion of B during 
the time interval T could be considered as a mapping 

whose value at any instant is a placement. 

By the fundamental transformation law for mapping sets, r m n corresponds to a 
unique 

T xB^E 

which represents the same motion in a different mathematical aspect: for each 
instant t and particle h, the value m(t, b) is the point in space at which the particle 
b finds itself at instant t during the motion. But there is still a third way to describe 
the same motion. Because of the natural twist map B x T — ^ T x B, wc have 
B x T -^5- E, and hence by a different instance of the fundamental transformation 
we find 

B r J^ E T 

which seems to describe the motion equally well. Here 

E t — paths in E 

is a set, each element of which determines a whole map T — ^ E from instants 
of time to points of space (i.e., a “path”). The given motion determines, for each 
particle b, the whole path that b follows during T . The expressions 

B^E t 
B x T — >■ E 
T — ^E b 
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are thus the three descriptions of the same motion of the cloud. The first and last 
involve the infinite-dimensional function spaces E T and E B , whereas the middle 
one involves only the finite dimensional B x T — A- T x B. 

Although maps whose codomain is a function space (= mapping set or expo- 
nential) can always be “transformed down” in the indicated manner, nothing of the 
sort is true in general for mappings whose domain is a function space. Mappings 
whose domain is a function space arc often called 

Operators or Functionals 

and include such things as differentiation and integration. An example of an inte- 
gration functional arises when we consider the mass distribution on the body B and 
the resulting map 

E b — >- E 

whose value at any placement is the center of mass (which is a point of E) of that 
placement. Thus, the description of a motion by means of 

T — 5- E b 

is necessary if we want to compute by composition 

E B 



7H m 


the map whose typical value is the instantaneous position of the center of mass. On 
the other hand, a typical differentiation operator is the “velocity” 

E t ^V t 

whose value at any path of points is the corresponding path of velocity vectors. If 
we are to calculate the velocity field resulting from a particular motion m of an 
extended body B. the appropriate description of the motion will be 

B^E t 

for then we can just calculate by composition 

E T 


motion 


differentiate 
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and then apply once again the fundamental transformation to get r vt~' 

T >V B 

whose value at any instant is the velocity field over the body. 


5.3 Functoriality of Function Spaces 

We will take as an axiom the following: 

AXIOM: EXPONENTIATION AXIOM 

For any two sets B, Y there exists a set Y B and an evaluation Y B x B -^A- Y having 

f 

the universal property that for each X and for each X x B > Y there is a unique 
X - 1 — Y B for which eval ( r / n x 1 B ) = f. Briefly, 

X -AC Y b 
X xB -^-Y 

with the evaluation map arising as the unique f for which r / n = 1 y b - 
Proposition 5.2: If Y\ — T 2 then there is a unique Y B — — Y B for which 


<p Br g n = r <Pg n 


B ^ Yl ^Y 2 


for all g. 


Proof: The axiom says that to construct a map whose codomain is a function space 
Y b it suffices to construct the corresponding map whose domain is a product, in 
our case Y B x B. But this is easily done as the following composition: 

Y b x B Yi Y 2 

V B yA V B 

r, - >r 2 

Notice that this means that (p B = r tp eval | n by the axiom, and thus 
eval 2 (^ B x 1 b) = tp evali 

Furthermore ( tp B x I n)( r g~' x 1 b) — <p hr g n x 1«. Thus, 

eval 2 (^ Sr g n x 1 B ) = eval 2 {cp B x l B )( r g n x 1 B ) 

= tp eval!( r g n x 1 B ) - tpg 
= eval 2 Og n x 1 B ) 

That is, both <p Br g~ and r <pg~ give the same result when crossed with 1 B and then 
composed with eval 2 . Since the axiom states the uniqueness of maps yielding a 
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given result under this two-step transformation, we conclude that they are equal, as 
Proposition 5.2 called for. ■ 


Remark: Note that the argument works j ust as well if r g n is a generalized element 
of Yf, that is, for X x B — ^ Y x . 


Exercise 5.3 

If Yi — ^ Y 2 -t- y 3 , then 


0 


The preceding exercise establishes the “covariant functoriality of induced maps 
on function spaces” with given domain B. But if we fix the codomain space Y and 
instead let the domain space B vary along maps, we find that there are again induced 
maps on the function spaces, but of a contravariant nature, in the following sense: 


Proposition 5.4: If B 2 


B i then there is a unique 


Y& i y Z?2 


for which 


for all g. 


y pr g n = r gr 


Proof: Again we construct by composition the map Y Bl x B 2 — #«?■ Y that uniquely 
corresponds by the fundamental transformation of the axiom to the Y ' f ' desired: 

Y b ' Y b 1 xBi ‘^L 1 Y 

This can be done along the lines of the proof of Proposition 5.2. ■ 

Proposition 5.5: If B^ B 2 '' >■ If. then 

yfia yayft 


Proof: The proof of the proposition is simply a “higher” expression of the associa- 
tivity of composition in 

B, B 2 B 2 

i 

'g 

i 

y 

Y 
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One way yP r g~' = r g/P; hence, 

Y a Y fir g - 1 = Y ar gf~ l = r (gf)a^ 

but the other way 

Y P<*r g n = r g(M n 

hence the two are equal for each g, which accounts for the conclusion. ■ 

In case Bn — >■ B\ is injective, then Y 1 — - Y - is often called the operator of 
restriction (of maps, restriction to the smaller domain along the paid of their origi- 
nal domain). 

The next proposition shows that the operation on mappings that we take as most 
fundamental, namely composition, can itself be expressed via an actual mapping - 
at least when the three sets involved arc fixed. 


Proposition 5.6: There is a map Y Y] x Y B - >■ Y B such that 


“c n O“\ r g n ) = r tpg~ 


for all 


B ^ Yl ^Y 2 


Proof: Define c by: 


Yf‘ x Yf x B 


Yf 1 



Note that r c n can be further transposed by another instance of the axiom to give 

Y 2 Vl -^-(Y 2 b ) (¥ i S) 

whose value at any r (p~ l is the name r cp B ~ l of the (covariantly) induced map on the 
function spaces with domain (= exponent) B. That is, the inducing operation is 
itself represented by a mapping. In a si mi lar fashion (taking a differently labeled 
instance of c) the third way (as in our example of motion) yields 

8*2 ^( Y Rl f B ' ) 

representing the contravariant-inducing process by an actual map. 
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Recalling that the set 2 represents, via the notion of characteristic function, the 
arbitrary parts of an arbitrary set B, 

B — - 2 
— >■ B 

we can combine this bijection with the function-space axiom to get 

1 >- 2 b 
— >■ B 

so that the elements of the set 2 B serve as effective names for the arbitrary parts 
of B and, in particular, the “number” of parts can be defined to be the number of 
elements of 2 B . Moreover, the meaning of the evaluation map in this case is just 
“membership”: 


2® x B -§vaL 2 
evalC - ^ -1 , b) = tpb 

thus, if cp is the characteristic function of a part i , we could define 

[b e i ] = eval( r ^ n , b) 

as the truth value of the statement that b is a member of i . 


Exercise 5.7 

If B 2 -A- B\, then 

2 b i 2^2 

represents by a map the operation of taking the inverse image along f> of arbitrary 
parts of B\. 0 

To justify the use of the exponential notation for function spaces, we will show 
that for finite sets the number of maps from B to Y is the numerical exponential - 
number of elements of F to the power number of elements of B. 


Proposition 5.8: 


F° 


1 


y A-\-B j/A ^ yB 


Proof: To construct 1 — >■ F° is equivalent to constructing 1x0 — >■ F, but since 
0x1 1x0, that is equivalent to constructing 0 — s- F 1 , which is unique'. 



106 


Mapping Sets and Exponentials 


the composite 1 — >- 7° — > 1 is the identity because maps to 1 are unique, and 
the composite 7° — >■ 1 — >■ 7° must be the identity since it corresponds to a map 
0 — ^-(Y) (} ’ of which there is only one. For the second statement from the propo- 
sition consider the injections i A , i B into the sum A + B \ by the functoriality of 7 ( ) 
these induce “restriction” maps Y A+B — =- Y A , Y A+B — 5 - Y B , which can be paired 
to yield 

(Y iA ,Y iB ) 

yA+B ^ yA ^ yB 

the map mentioned in the statement, which we want to show is invertible. To 
construct an inverse we must construct first a candidate 

7 A ^ yB yA+B 

which is equivalent to constructing 

a + b ^y {YA * yB) 

which must necessarily be the “copair” of two maps 

A y(Y A xY B ) 

B ^ y(Y A xY B ) 

But such a couple of maps is equivalent to 

Y a x 7 8 ^ 7 j 
Y a x Y b ^Y b 

for which the obvious choices arc the projections', tracing back through the equiv- 
alences, we then have our map 

Y A ^ yB ^ yA+B 

which we must show is the inverse. The map is the one whose value at a pair of names 
is the name of the copair, 

C/aT r /B n } goes to 

which is indeed clearly inverse to the previous one whose value at r /“ l is 

( r fi A A , r fi B A )■ 



Notice that the effect of Proposition 5.8 is to represent the invertible process 
characteristic of sum 


A + B — s-7 
.1 5-7. B > 7 


It 
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as an actual invertible map. In a similar way the processes characteristic of product, 
and even the processes characteristic of exponentiation itself (!), can be represented 
by actual invertible maps by allowing the exponential sets to intervene, as will be 
shown in the following two propositions. 


Proposition 5.9: 



Proof: Since the assertions of the proposition are that certain specific maps arc 
invertible, we must find these inverses. The inverse of the unique I s — s- 1 is the 
“name” 1 — 5 - \ B of the unique B — >■ 1. The map (jt B , n B ) has an inverse, 

F ( f x 7* — ^(To x 70 s 

which is constructed as the map corresponding to 

Yq x 7," x B 7 0 x 7j 

where 


e( r go“\ r gi^,b) = (g 0 b, gib) 


Thus Proposition 5.9 internalizes or objectifies the defining property of the ter- 
minal object and the defining process of the product concept. 

Proposition 5.10: 

7 7 1 

yXxB B'jX 

Proof: The diagrams: 



Y 1 x 1 
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define the two maps inverse to each other for the first “equation”. For the second, 
we have 


yXxS ? ( ryB\X 

y xxb x X _u (Y B ) 
Y XxB x X x D Y 


(X 


B\X ? , yXxB 


( Y B ) X X (X X B) -U Y 


(( Y b ) x x X) x B- 


■Y 


eval A -xi B '\ i . / eval B 
( Y B ) x B 


Schematically, the first map is r / n i-»- r x i->- fix, — ) n , and its inverse is r tp~' i->- 
r {x, b) i— >- ((px)ibp. (Do Exercise 5. 1 6 .) Thus, the invertible map of the proposition 
internalizes the process 

X x B — 

X- > Y H 

which is the defining property of exponentiation itself. ■ 


More surprising is that the existence of exponentiation implies a fundamental 
“equation” that does not mention exponentiation but is only concerned with the 
lower-order operations of sum and product. We refer to the distributive law, which 
is discussed in Section 7.2. 


5.4 Additional Exercises 

Exercise 5.11 

In Exercise 3.46 we saw that the linear transformations between finite-dimensional 
vector spaces V and V' have a natural commutative group operation. In fact they 
even form a vector space, often denoted L(V, V'). This is easily seen by remem- 
bering that the linear transformations may be represented by matrices (though this 
representation depends on choosing bases for V and V'). 

Show, however, that L(V , V) does not satisfy the exponential axiom. 

Hint: There is a simple dimension argument to show this. 

Exercise 5.12 

The categories S/X have mapping sets. To see how these are constructed, let 
A — ^ X and B —X x be objects. We can write these as families of fibers: {A x ) xe x 
and (B x ) xeX - The mapping set a 1 ' 1 has, as fiber over the element x, the mapping set 
in S denoted A 

Show that a 1 '' 1 has the correct universal property. 
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Exercise 5.13 

For any monoid M the category of M-sets (see Exercise 3.51) has mapping sets. 
Construct these from the equivariant mappings and show that the universal property 
holds. 

Hint: The construction is a little more subtle than might be supposed. However, if 
M happens to be a group, your general description of the M-set Y x for M-sets X 
and Y can be shown to be equivalent to the following simpler description: Consider 
the set of all arbitrary mappings from the underlying set of X to the underlying set 
of Y and define on it an appropriate action of M. 

( Caution : If your definition of this appropriate action does not explicitly involve 
the inverse operation of the group, you will not be able to prove the universal 
property.) 

Exercise 5.14 

The category of partially ordered sets has exponentials. If X and Y are partially 
ordered sets, the mappings from X to Y . which are order-preserving, have a natural 
order inherited from the order on Y . 

Show that this ordered set has the universal property of the exponential. 

Exercise 5.15 

Complete the proof of Proposition 5.9 by showing that the maps defined are inverse 
to each other as claimed. 

Exercise 5.16 

Complete the proof of Proposition 5. 10 by showing that the maps defined are inverse 
to each other as claimed. 

Exercise 5.17 

Let A and B be categories. The concept of functor F from A to B is defined in 
Appendix C.l, as well as in 10.18. 

(a) Show that the assignments A(X) = (X, X) for any set X and A (/) = (/, /) 
for any mapping / define a functor A : S — >- S x S: A is called the diagonal 
functor. 

(b) Show that the assignments (— x B)(X) = X x B for any set X and 
(— x B)(f ) = / x Is for any mapping / define a functor (— x B) : S — S . 

(c) Show that the assignments (— ) B (Z) = X B for any set X and (— ) B (f) = f B 
for any mapping / define a functor (—) H : S — ^S. 
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(d) Show that the assignments — x — (X, Y) = X x Y for any pair of sets (X, Y) 
and — x —(/, g) = f x g for any pair of mappings (/, g) define a functor 
— x — : S x S — ^S. 

(e) If V and W are vector spaces considered as categories (Exercise 1.31) show 
that any linear transformation between them is a functor. 

(f) If X and Y arc partially ordered sets considered as categories (Exercise 1.31), 
show that a functor between them is the same thing as a monotone mapping. 
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Summary of the Axioms and an Example 
of Variable Sets 


6.1 Axioms for Abstract Sets and Mappings 

We have now seen most of the axioms we will require of the category S of abstract 
sets and mappings. As we progressed, some of the earlier axioms were included 
in later axioms. For example, the existence of the one-element set is paid of the 
axiom that S has finite limits. Although we did not insist on it earlier, it is also the 
case that some of the axioms are more special than others. By this we mean that 
even though they hold in S they will not generally hold in categories of variable 
or cohesive sets. Thus, we arc going to review the axioms here so that they can be 
considered all at once and grouped according to their generality. 

The very first axiom, of course, is 

AXIOM: 5 IS A CATEGORY 

We have been emphasizing all along that the fundamental operation in a category, 
composition, is the basic tool for both describing and understanding all of the other 
properties of S. 

The next group of three axioms is satisfied by any category of sets, variable or 
constant. In fact a category satisfying them is called a topos (in the elementary 
sense), and these categories have been studied intensively since 1969. 

AXIOM: FINITE LIMITS AND COLIMITS 

S has all finite limits and colimits. 

AXIOM: EXPONENTIATION 

There is a mapping set Y x for any objects X and Y in S. 

AXIOM: REPRESENTATION OF TRUTH VALUES 

There is a truth value object 1 ' >- G, i.e. there is a one— one correspondence be- 

tween parts (up to equivalence) of an object X and arrows X — *- G mediated by 
pullback along t. 


Ill 
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Some of the consequences of these axioms have been studied already. Note that 
the truth value object £2 is not required to be 1 + 1. Indeed, in the next section we 
will see a first example of variable sets, and it will be immediately obvious when we 
compute the truth value object there that it is not 1 + 1. Thus, we now need to sepa- 
rate two properties that were merged in stating the truth- values axiom in Section 2.4. 

Since £2 is a truth value object, the monomapping 0 %• >• £2 has a character- 
istic mapping called -■ : £2 — >■ £2. Precomposing with 1 ' >- £2 defines another 

element of £2 called / for false. 

AXIOM: S IS BOOLEAN 

S2 is the following sum: {' : 1 + 1 — S2 where the injections are , and f. 

The special toposes which (like S) satisfy this last axiom arc called Boolean 
toposes; they allow the use of classical logic (see Appendix A). 

The next axiom has not been explicitly stated until now. We explain why below, 
but it should be pointed out that there are Boolean toposes that do not satisfy it. For 
example, the category S/X is always a Boolean topos, but if X has more than one 
element, then the category does not satisfy this axiom. 

AXIOM: 5 IS TWO- VALUED 

£2 has exactly two elements. 

The two elements of £2 must then be t and /. The axiom is equivalent to the re- 
quirement that 1 has exactly two parts. 

Exercise 6.1 

Prove that S/2 is not two-valued. 0 

AXIOM: THE AXIOM OF CHOICE 

Any epimorphism has a section. 

Thus, as we have already pointed out, there is a representation of the cells of a 
partition by a choice of elements of the domain of the partition. This axiom has 
a very different character from the others. For one thing it can be shown that it 
implies the Boolean axiom (see McLarty [M92], Theorem 17.9). 

The axiom of choice also implies, as a special case, that for any set X the 
epimorphic part of the unique mapping X — >■ 1 is a split epimorphism. Since this 
image is called the support of X, this special property is sometimes referred to by 
saying that supports split. 
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The principle that nonempty sets have elements is false in most toposes of variable 
or cohesive sets; moreover, there arc several different precise meanings to the term 
“nonempty” relative to which the principle may be true in various special cases. 
Rather than just X not equal to 0, the requirement that the terminal map X — >■ 1 
be an epimoiphism is sometimes a more reasonable expression of the idea that X 
is not empty. 

Recall that we assumed in Chapter 1 that S satisfies two other important pro- 
perties: 

(i) 1 is a separator, and 

(ii) in S we have 0^1. 

A topos that satisfies (i) and (ii) is called a well-pointed topos. It is a theorem 
[MM92] that a topos is well-pointed if and only if it is Boolean, two- valued, and 
supports split. As a result, the two properties we required for S in Chapter 1 , namely 
(i) and (ii), that is that S is well-pointed, are actually consequences of the axioms we 
have already stated. Conversely, our assumption in Chapter 1 that S is well-pointed 
implies that S is Boolean and two-valued. It also implies the special case of the 
axiom of choice called “supports split”. 

There is one more axiom we will require of S. The axioms so far say nothing 
about the possibility of mathematical induction. When a stalling element of a set 
and a process for forming new elements of the set (an endo-mapping) are given, a 
unique sequence determined by the starting element and the process should result. 
Mathematical practice usually makes the idealization that all such sequences are 
parameterizations by a single object N ; such an N must be “infinite”. However, the 
axioms we have so far do not guarantee the existence of such an object; we will 
consider it in Section 9.1. In fact, within any category of sets that has an infinite 
object it is possible to find a Boolean topos that does not have such an object. 

In summary then, we can say precisely what we mean by a category of abstract 
sets and arbitrary mappings. It is a topos that is two-valued with an infinite 
object and the axiom of choice (and hence is also Boolean). Experience shows that 
mathematical structures of all kinds can be modeled as diagrams in such a topos. 

We conclude this section by fulfilling the promises made within Claim 3.4. The 
first of these is the following: 

if X 0, then X has an element 1 — ^X. 

We will show that — »- 1 , and then the axiom of choice provides the 

element we need. Now the (split) image factorization of X — 1 as A — »- 1 ® s- 1 

determines a subobject of 1. Since S is two-valued and Boolean, there arc only two 
mappings 1 — | + 1 = Q. These classify the only two subobjects of 1, which thus 
must be 1 and 0. Hence, I is either 1 or 0. So we will have completed the proof if 
we know that X — 0 implies X = 0. 
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Exercise 6.2 

Show that XxO 0 is an isomorphism. 

Hint: Recall that F° = 1. Use this to show that any arrow X — *-0 has an inverse. 

0 

The second promise was that every part X c — Y has a complement 
X' c — Y. One way to see this follows from the next exercise: 


Exercise 6.3 

Let A ( ) ! °>- A ( ) + A i Ai be a sum and B H » A 0 + A j. In the diagram follow- 
ing with both squares defined to be pullbacks we have that the top row is a sum. 



0 


Now, to fulfill the second promise, we may take the sum diagram for O (since S 
is Boolean!) as the bottom row and use the characteristic map Y — h»- O for X as g. 
We conclude that Y is the sum of X and X' since X' is by definition the pullback 
of ip along the false map /. 


6.2 Truth Values for Two-Stage Variable Sets 

Perhaps the simplest kind of variable set (beyond the constant kind whose category 

'7°P 

S has been studied up to now) is the category S of two-stage sets 
one connection between the stages. Here we will use the symbols 2 — 
to suggest a (previous) stage U, which, together with a present stage constitutes a 
total movement 1. A set X in the category S 2 ' e of all sets undergoing this movement 
will be analyzed in terms of constant sets in S as 

V, 

«-V 

Xu 

Here X\ in S is the set of elements of X that persist throughout both stages, Xu in 
S is the set of elements of X that persisted through the previous stage, and % x is 
the map describing the internal structure of X by specifying for each element x of 
X\ the element (of X v ) that x “was” during the previous stage U ; the result of this 


with only 
U^l 
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specification is denoted by $x(x). A map X Y in S 2 '" is analyzed as a pair f \ , 
fu of maps in S for which $yfi — fu^x, that is the square below commutes: 



Exercise 6.4 

If §y/i = fu^x and % z gi = gu^r, then 

£z(gi/i) = (gufu)Hx 


0 


_^op 

By the exercise, we get a well-defined operation of composition of maps in S : 

(gf) 1 = glfl ( gf)u = gufu 

The terminal set 1 of S 2 P is (in its 5-analysis) 


1 = 


1 

ii 

1 


since, for any X, there is a unique commutative square 

X l -1 

ex 

x v 


in S. To what extent do the maps 

1-^-Z 

in 5 2 ' 1 ' represent the elements of XI The equation required of any map in this 
category reduces to 


X u 
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in this case; thus, x is entirely determined by x\, and conversely any element x\ 
of X\ in S (no condition) determines, using % x , a unique xu making the square 
commutative, i.e. determines a unique 1 -?->■ X in S 2 \ Thus, we may briefly say 
that the maps 1 — ^ X “are” the elements of X that live through both stages. 

How can we represent the elements of X in their aspect of existence only through 
the previous stage U? To that end consider the variable set 

0 

U = 

1 

which has no elements that persist through both the present and previous stages but 
has one element throughout the previous stage. This U then is a nonzero variable 
set with no element in the narrow sense. The property that nonzero abstract sets do 
have elements, as discussed in the previous section, is seen to be already violated 
with this simple variation. Now for any X , the maps U — ^ X in N 2 may be 
analyzed in S as 




but X] is unique, and the square commutes no matter what element x\j of X v is 
chosen. Thus, we may say the maps U — ^ X “arc” the elements of X that existed at 
the previous stage. 

Exercise 6.5 

There is exactly one map U — s- I in o' . Moreover, given any 1 — >• X, the com- 
position U —^4* 1 X in S 2 "'' represents the x\j that the specifies as the element 
that x “was” throughout U . 

(Draw the appropriate diagram to verify the statement.) 0 

Exercise 6.6 

A map A 1 X in S 2 '' is a paid of X if and only if the following three conditions 
are all satisfied. 

A i is a paid of X \ in S 
A v is a paid of X v in S 
Hxh — iu%A 


0 
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Moreover, % A is uniquely determined (if it exists) by X, i\,iu\ the condition that 
exists is 


Vx e Xi[x e A 



Now we will calculate the truth-value variable set V( \ ) = Vzi. I ) in S 2 that 
precisely represents all parts in that category through characteristic functions 


X^V(1) 
?c »X 


It 


An arbitrary paid A M >- X may involve elements of X that arc not now in A but 
were previously in A. Yet the characteristic function <p of A must be defined for all 

■2°P 

elements of X , and of course <p must be a map in o .By taking X = 1 and X — U 
above, this forces 


V(l) = 



to have three truth values globally, but only two truth values in the previous stage. 
For any A A m X, one can define cp at the “present” stage by 

r 0 if x ^ A, §x(x) ^ A 
<Pix = < U if x ^ A, ^x(x) € A 
{ 1 ifx e A 


Exercise 6.7 

Define also (p v and show that cpu^x = Apr i ^ i • For each of the two types of elements 
of X, 


x e A 


tpx 


1 


0 


6.3 Additional Exercises 

Exercise 6.8 

The categories S/X satisfy most of the axioms in Section 6.1, as we have seen in 
Exercises 3.48, 4.46, 4.52, 5.12. Show that S/X has a truth- value object, namely, 
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the projection 2 x X — Thus, S/X fails to be a category of abstract sets and 

mappings only in that it is not two-valued. 

Exercise 6.9 

(a) Show that there is a ( diagonal ) functor (see Appendix C. 1) denoted Ax from S 
to S/X whose value at a set A is the object of S / X given by the projection map- 
ping A x X — s- X. (So define Ax also on mappings and show that it satisfies the 
equations.) 

(b) Show that there is a functor Ex (for sum, why?) from S/X to A that sends an 
object Y — >■ X of S/ X to the set Y. 

These two functors have the following important relationship: 

(c) Show that for any set A in S and object Y — — X in S/X there is a one-one 
correspondence between mappings in S from E x ((p) to A and mappings in 
S/X from (p to A x(A). (Ex is left adjoint - see Appendix C. 1 - to Ax-) 

Exercise 6.10 

Show that S 2 ' P has all finite limits and colimits. 

Hint: They arc computed “pointwise”. 

Exercise 6.11 

The previous exercise and the description in Section 6.2 of V( \ ) show that S 2 ' is 

not Boolean. Show that it is not two-valued either. 

Exercise 6.12 

(a) Show that epimoiphisms in S 2 4 are pointwise. By “pointwise” here we mean 
a property holding at both 1 and U . Thus, epimoiphisms arc pointwise means 
that X — ^ Y is epi in S 2 1 iff f\ and ff are epi in S. 

(b) Show that supports split in A 2 ' 1 ’, but 

_?op 

(c) Show that the axiom of choice fails in S . 

Hint: There is a nonsplit epimoiphism between objects with no more than two 
elements at each vertex. 

Exercise 6.13 

Show how to construct mapping sets in S 2 ’ P . 

Hint: (Y x )u = Y x " and (T a )i is a certain set of pairs of mappings. 
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Exercise 6.14 

(a) Show that there is a ( diagonal ) functor (see Appendix C.l) denoted A from S 
to S 2 " v whose value at a set A is the object of A“ P given by the identity mapping 
A — 3 - A. (So define A also on mappings and show it satisfies the equations.) 

(b) Show that there is a functor dom from S 2 to A that sends an object A — 

X \ of A 2 ° P to the set X\. 

(c) Show that there is a functor cod from S 2 '' to S that sends an object X = 
X \ X v of 5 2 P to the set X (J . 

These three functors have the following important relationships. Show that 
for any set A in S and object X — X\ ^ Y > X u of A 2 there are one-one 
coiTespondences 

(i) between mappings in S from cod(A) to A and mappings in S 2 '' from X to 
A (A) (cod is left adjoint - see Appendix C. 1 - to A) and 

(ii) between mappings in S 2 P from A (A) to X and mappings in S from A to 
dom(X) (dom is right adjoint to A). 

Exercise 6.15 

If M is a monoid, the category of M - sets has finite limits and colimits as well as 
mapping sets (Exercises 3.51, 4.47, 5.13), but does not satisfy the axiom of choice 
(4.54). It does have a truth-value object. First, note that a subobject of an M-set X 

is simply a paid A c > A of A that is closed under the right action of M. For its 

characteristic function < we send x e A to the set / of i e M such that xi e A. 
This I is a paid of M closed under right multiplication by all of M (called a “right 
ideal”). The set of right ideals of M, denoted Sl M , has a right action by M given by 
Im = {j e M | m j e I}, and so it is an M-set. It is the object we seek; M itself is 
an element of Sl M that plays the role of “true”. 

Show that, with the characteristic mappings just outlined Q M is indeed the truth- 
value object. 

Thus, the category of M-sets is a topos. 
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7.1 Concrete Duality: The Behavior of Monies and Epics under 
the Contravariant Functoriality of Exponentiation 

Any conceivable cancellation law states in effect that some algebraic process or 
other is injective. This is most evident in the definition of the concept of monomor- 
phism, whereby the “injectivity” of the algebraic process of composing with / on 
the left turns out to be equivalent to injectivity of / itself as a map. In terms of 
function spaces we can express the equivalence 

/ injective -<=> / monomoiphic 


simply by 

yT /! > yT 

^ injective 1 

since we see that to say that f T is injective in its action on elements V, ^X r 
defined on 1 is equivalent to saying that for all x\,xi 


X 


.r 


injective 


vr 


T 



/*! = /-' 


X 2 


if we only recall that the action of / 7 is 

f Tr X~> = rf x n for a ll T X 
But what if / is epimorphic? Since the statement 

<Pl f = <Plf =» <P\ = <P2 

is also a cancellation property, it also expresses that some process is injective, and 
if we look a little more closely, we see that the process in question is the one 
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represented by the contravariant functoriality of mapping sets. For later use it is 
preferable to state this fact as property relative to a given object V. 


Proposition 7.1: For a given X >- Y and a given V, 

V Y V x 

is injective (on elements ) exactly when 

9\f = 92 f =>• <Pi = 92 

(pi 

holds for any Y V. 


Proof: 


V fr tp ~ 1 = r (pf~ 


Corollary 7.2: X — 


Y is epimorphic if and only if 


is injective for all V. 


V Y yX 


It will be recalled that the definition of (for example) “epimorphism” is the formal 
dual of the definition of “monomorphism” in the sense that one simply reverses all 
arrows in the relevant diagrams; of course if the original diagrams had been given 
specific interpretation in terms of specific sets and mappings, such interpretation 
is lost when we pass to this formal dual in that the formal dualization process 
in itself does not determine specific sets and specific mappings that interpret the 
dualized statement. On the other hand, for any given V, we do have the process of 
contravariant functoriality that for any specific diagram, say 

X > y 

produces a specific diagram (with “bigger” sets!) in which all arrows have been 
reversed 


V x - t V' v 

and that satisfies (at least) all the commutativities (= statements about equality of 
compositions) satisfied by the original diagram except, of course, that the order of 



122 


Consequences and Uses of Exponentials 


the composition has been reversed. This is often referred to as “concrete duality 
with respect to V” or “dualizing into V”. Not every statement will be taken into its 
formal dual by the process of dualizing with respect to V , and indeed a large part 
of the study of mathematics 


space vs. quantity 


and of logic 


theory vs. example 

may be considered as the detailed study of the extent to which formal duality and 
concrete duality into a favorite V correspond or fail to correspond. 

In the context of constant sets, the choice V — 2 is the starting point of many 
such duality theories. 


Theorem 7.3: X 


Y is epimorphic if and only if 2 } 


2f 


2 X is monomorphic. 


Proof: The “only if” direction is a special case of (the internalization of) the defi- 
nition of “epic”. 

Conversely, if 2 5 -=->■ 2 X is monomoiphic, then / satisfies the (restricted to 2) 
right-cancellation property 

Z-^T==^2, tpif = cp 2 f =>• <pi = cp 2 

But we can take 


<pi — characteristic function of the image of / 

<p 2 = identically true, 

which will surely satisfy <p\f = (pif\ if we apply the assumed cancellation property, 
we see that / is epimorphic (hence intuitively that the image of / fills up the whole 
of Y). m 

Because the dualization functor is not reversible without modification, the fol- 
lowing further duality property does not follow from the previous theorems. 

f Y 2? X 

Theorem 7.4: X Y is a monomorphism if and only if 2 — >- 2 is 
epimorphic. 
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f 

Proof: If T > X — Y have equal composites, then by functoriality so do 

X2 


and thus if 2? is epimoiphic, then 2 XI = 2 X2 , from which we will conclude shortly 
that x\ — X 2 , so that / is monomoiphic. 

Conversely, if / is assumed monomoiphic, we need to conclude that 


2 y 2 ^ 2 x 


is epimoiphic, for which it suffices to show that 2* is surjective. So assume r (p~ 
is any element of 2 X \ we must show that there is at least one r x//~ l for which 
2 ^ r T/r _l = r (p~', i.e. for which 



But tp is the characteristic function of a paid i of X, and by assumption / is a part of 
Y ; thus, the composite fi is a part of F; we naturally guess that taking f to be the 
characteristic function of this composite paid will satisfy our requirement, which 
we can prove as follows: 

<px = true <t==> v e i 
xj/y = true <=> y e fi 


S c 


-x c 


Y 



Calculate, for any v 


if f)x = true 

f(fx) — true 

$ 

fx e fi 

$ 

x G i 

$ 

( px = true 


Hence, ff = tp as required. 
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We return to showing that 


2 Xl = 2 X2 = 4 > xt = X 2 

A useful tool is the singleton map { } defined as the exponential transpose of the 
characteristic function of the diagonal 

Ixl 

X x X — >2 
X -U- 2 X 


In other words if {x} = r <p n and i is the part of X with characteristic function ip, 
then for any x' 


x e i 4=>- x' — x 

Now if we assume that T X are such that in 2 X > 2 r we have 

x 2 { i 

2 Xl =2 X2 , then composing with X — 2 we get equal maps A g: 2 and hence 
equal maps T x X > 2 and therefore equal parts of T x X. But it is easily seen 
that the parts of T x X arising in this way are actually the graphs of the corres- 
ponding maps T > X\ if the graphs arc equal, then the coiTesponding maps 
are equal, as was to be shown. ■ 


Theorem 7.4 about 2 in the category of constant sets becomes a definition of a 
special kind of object V in more general categories. 


Definition 7.5: An object V is an injective object if for every monomorphism 
X c — >■ Y and every X ¥ '>- V there exists Y A- V such that 


<Pf = V 



Y 




In case the category has exponentiation, V is injective if and only if for every 
monomorphism X —^~Y, V y -^4- V x is surjective (on elements 1 — 5 - V x ). It is 
not surprising that most of the concrete dualities in mathematics and logic that work 
well involve dualizing into an injective object V. 
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Another twist on the same underlying problem is the following: Let V be any 

f 

given object and define X — >- Y to be co-surjective relative to V to mean 


V(fi 3 <p[<pf = <p\ 



Y 


By abstract duality plus justifiable prejudice we might expect co-surjectivity to be 
similar - to injectivity for a map. To make this precise, consider one more definition: 

xi 

For T ~jr X, say that x\ = y x 2 if and only if 
*2 

XX -A- V[( -px\ = tpx 2] 


This =v may be read as “congruent modulo V” or more simply as “equal insofar 
as V 7 -valued properties can distinguish”. 


Proposition 7.6: If f is co-surjective relative to V, then f is “monic modulo V,” 
i.e., 

f.Xi =v /x 2 =^X 1 =y X 2 


Proof: The hypothesis of the implication refers of course to testing relative to all 
Y — 5- V. We must show <px 1 — <pxi for any given X — J -~~ V. But by co-surjectivity 
<p can be extended to a (p 



<px 1 = (spf)x 1 
= V(fx 1 ) 

= Vifxf) by the hypothesis fx 1 =y /x 2 
= (SPf)x 2 

= 1 'PXl 


Without yet going into detail concerning the modifications necessary to reverse 
the concrete duality, we note that double dualization V (v ) depends only on the 
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dual V x and is a covariant functor in that X 


Y induces 




V' 


in the same direction. Moreover, there is for each X a map 




defined by 


tp\ 


<px 


and often written (ignoring the distinction between a map and the corresponding 
element of a mapping set) as x((p) = <p(x). The map ( ), sometimes called the 
Fourier transform or the Dirac delta, is natural in the sense that for any map 


X 


f 


Y the diagram 



is commutative. In case V is a coseparator, the map ( ) is monic and thus X is 
(the domain of) a part of the double dual of X. The fact that this double dual 
is much larger than X can often be overcome, and X can actually be recovered 
from the knowledge of its dual A — V x by noting that the “variable quantities” 
A and the “constant quantities” V have some algebraic structure in common and 

considering only the paid A* = Horn (A, V) c >■ V A whose members correspond 

to those maps A — >- V that preserve this algebraic structure; the thus-tempered 
(V x ) has a better chance of being invertible, as we shall see in Section 8.4. 


X ^ F (\rtt 


7.2 The Distributive Law 

The distributive law states that a natural map 

A x Xi + A x Z 2 — s- A x (X\ +X 2 ) 

is actually invertible. Here the x and + denote product and coproduct (= sum 
in the case of sets), and the natural map exists whenever product and coproduct 
exist. In fact the natural map in question comes from a special case of the fol- 
lowing observation: to define a map ? — > A x S whose codomain is a product is 
equivalent to defining maps from ? to each of the factors, whereas to define a map 
P\ + Pi — ?? whose domain is a coproduct is equivalent to defining maps from 
each of the summands to ??; hence, in case we have both that the codomain is known 
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to be a product and that the domain is known to be a coproduct, then defining a map 
is equivalent to defining a “matrix” of (smaller) maps specifying each component 
of the value of the big map for each kind (summand) of input. For example, any map 

Pi + P 2 —^A x 5 

(between the indicated combinations of any four given sets) is specified by a 
2x2 matrix whose entries would be the four possible threefold composites with 
projections and injections. 



In the case at hand, where S — X\ + X 2 , Pk = A x XT, we have enough structure 
to actually specify such a matrix, where 

Pi -^AisAxIi-^A 
P 2 -^AisAxXo-^A 
Pi 5 is A x Xi Xi -H- Xi + X 2 
P 2 — ^ S is A x Xi 1- A'; ini > Xi + X 2 

Of course, the natural map just described in detail is the one corresponding to 
the intuitive picture of a product as a rectangular arrangement and of a coproduct 
of sets as a disjoint union: 



A x X 2 



A x X, 


A 


X,+X 2 


A 


However, although the “rectangular arrangement” picture of the elements of a 
product is justified by the defining property of products, the “disjoint sum” picture 
of the elements of a coproduct is NOT justified by the defining property in itself 
of coproducts. This is because the defining property of a coproduct X\ + X 2 — S 
refers to co-elements of 5, 


5 


V 
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and says nothing in itself about how to determine the elements 

T — 

that S might have. Indeed, in other important categories the coproduct may be quite 
unlike a disjoint sum; for example, in the basic category of linear algebra namely, 
the category of vector spaces and linear transformations (also called linear spaces 
and lineal - maps) the coproduct is the same space as the product (see Exercise 3.46), 
and hence neither the distributive law nor “disjointness” could hold. Thus, the in- 
vertibility of the natural map A x X\ + A x X 2 — >■ A x (Xi + X 2 ) in the context 
of nonlinear spaces and nonlinear maps (in particular for abstract sets and arbitrary 
maps) must be due (like the disjointness of +) to some further feature of these 
categories beyond the mere existence of product and coproduct in themselves. The 
existence of exponentiation turns out to supply this feature. For to show that the 
natural map has an inverse we must construct a candidate 

A x (X, +X 2 ) — x Xi + A x X 2 

and then verify that the candidate really is inverse; the construction is the step that 
requires some additional ingredient. But note that the required candidate should 
be a map whose domain is a product and that since the crucial feature of the 
exponentiation axiom is an invertible process, we could just as well view that 
axiom as a prescription for constructing lower-order maps A x S — 5 - Y by instead 
constructing higher-order maps S — 3 - Y A . This apparently perverse interpretation 
of the exponentiation axiom turns out to be exactly what is needed in this case (and 
in many other cases) since it enables us to gain access to certain special structures 
that S has. That is, the proof of the distributive law can be achieved by constructing 
an inverse map 

Ax(X 1 + X 2 )^>AxX 1 + AxX 2 

as we have seen. But by the exponentiation axiom, we can equivalently construct 
Xi + X 2 ^>(AxXi + Ax X 2 ) a 

which (by the special coproduct structure of 5 = X\ + X 2 ) is equivalent to the 
problem of constructing two maps (for k = 1,2) 

X t .^(AxXi + AxX 2 ) A 

By applying exponentiation again, but in the opposite direction, this is equivalent 
to constructing two maps 


A x X k 


A x X] + A x X 2 
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B ut such maps are staling us in the face - the two injections ! Hence, we can construct 
the required candidate map 

A x (Xj + X 2 ) — ^ A x Xi + A x X 2 

which can be shown to behave as expected, 

{a, ikx) m>- ik(a, x) 

on elements (when 1 — X 2 ). 

Exercise 7.7 

Verify that the candidate map just constructed really is the two-sided inverse of the 
natural map 


AxXi + AxX 2 — >- A x (Xj + X 2 ) 


0 


Exercise 7.8 

Prove (using exponentiation) that 


Exercise 7.9 

f 

Prove (using exponentiation) that if X — s- Y is an epimoiphic map, then 

AxX^Axf 


is also epimoiphic. 


0 


7.3 Cantor’s Diagonal Argument 

Over a century ago Georg Cantor proved an important theorem that includes the 
result 

X <2 X 

for all sets X (see Def. 7.15 below). This result, well-known for finite sets X, was 
quite new for infinite sets since it showed that some infinities are definitely bigger 
than others (even though many constructions on infinite sets X tend to give sets of 
the same cardinality, that is, 


2xX = X, 


x 2 = x 
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hold for infinite abstract sets). Indeed, this specific and fundamental construction 
leads to a potentially infinite sequence of larger and larger infinities 

Y 9 * 

v ox 0 2 a fir 
X <2 <2 <2 < . . . 

An even more fundamental consequence of Cantor’s theorem is the obvious con- 
clusion that there cannot exist a “universal set” V for which every set X appeal's 
as the domain of a part of V 


> V 

because if there were such a V, we could take X = 2 V to reach a contradiction since 
(see Theorem 7.4) the restriction map 

2 V — » 2 X 

is surjective, but Cantor showed that no map X — ^2 X is surjective. 

Cantor’s method for proving this theorem is often called the “diagonal argument” 
even though the diagonal map S x is only one of two equally necessary pillars on 
which the argument stands, the second being a fixed-point-free self-map r (such as 
logical negation in the case of the set 2). This diagonal argument has been traced (by 
philosophers) back to ancient philosophers who used something like it to mystify 
people with the Liar’s paradox. Cantor, however, used his method to prove positive 
results, namely inequalities between cardinalities. The philosopher Bertrand 
Russell, who was familiar with Cantor’s theorem, applied it to demonstrate the 
inconsistency of a system of logic proposed by the philosopher Frege; since then 
philosophers have referred to Cantor’s theorem as Russell’s paradox and have even 
used their relapse into the ancient paradox habit as a reason for their otherwise 
unfounded rumor that Cantor’s set theory might be inconsistent. (Combatting this 
rumor became one of the main preoccupations of the developers of the axiomatized 
set theories of Zermelo, Fraenkel, von Neumann, and Bernays [Sup72]. This pre- 
occupation assumed such an importance that the use of such axiom systems for 
clarifying the role of abstract sets as a guide to mathematical subjects such as 
geometry, analysis, combinatorial topology, etc., fell into neglect for many years.) 

Around 1930 both Godel and Tarski again used exactly the same diagonal argu- 
ment of Cantor, except in categories of a more linguistic nature than the category 
of sets, to obtain their famous results to the effect that (Godel) for any proposed 
axiom system for number theory there will always be further truths about the num- 
ber system that do not follow as theorems in that axiom system, and (Tarski) even 
though it is easy to construe mathematical statements <p as mathematical entities 
r tp~', there can be no definition of a single “truth predicate” T such that, for any 
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statement tp and every specific mathematical entity x, 

r (p~'Tx <=>• tpix) 

is a mathematical theorem. 

We will first prove the theorem in a still more positive form as a fixed point 
theorem, of which Cantor’s theorem will be the contrapositive. Cantor himself 
proved cardinality inequalities not only for the set 2 X of subsets but also for the 
set M a of real-valued functions, and correspondingly our fixed-point theorem will 
deal with objects Y more general than Y — 2. 

Definition 7.10: A self-map Y — — Y of an object Y is said to have 1 — Y as a 
fixed point if and only ifry = y. Thus, r is said to be fixed-point-free if and only if 
VI — Y\ tv f v |. At the other extreme, an object Y is said to have the fixed-point 
property if and only if every self -map r has at least one fixed-point y. 

Remark 7.11: Although the fixed-point-property is so rare as to be uninteresting 
for the category of abstract constant sets and arbitrary maps, it is much more frequent 
and useful in the category of continuous spaces and maps, where Brouwer proved 
that the n -dimensional ball has the fixed point property; for n — 1, this fact implies 
existential statements such as Rolle’s theorem, which involves continuous maps of 
the interval [—1, 1] into itself. 

Theorem 7.12: Suppose there is an X and a map (p 

XxX^>f 

f 

such that for every X -A- Y there is at least one 1 — X such that 

f — (p{a, -) 

Then Y has the fixed-point property. 

Proof: Consider any Y — Y. We must show that r has a fixed point. Define an / 
by the triple composite 


X x X—^Y 



In other words, for all x 


f X = T(p(x, X) 
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Now by the assumed property of <p, this / must be “^-represented” by some a: 

f x = <p{a, x ) 

for all x. Hence, 


r tp(x, x) = <p{a, x) 

for all x . In particular, if we take x = a, 

t(p(a, a) = tp(a , a) 

which means that if we define y in terms of a by commutativity of 



Corollary 7.13: ( Cantor) IfY has at least one self-map x with no fixed points , then 
for every X and for every 

X Y x 


O is not surjective. 

Proof: Surjectivity of 0 expressed by the diagram 

/ 

T 

is exactly the property assumed in the theorem for the corresponding tp for which 
0 = r (p^. The corollary is the contrapositive of the theorem, and any statement 
implies its contrapositive. ■ 

Corollary 7.14: There is no surjective map 

X — » 2 X 
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nor is there any surjective map 


X 


where R is the set of real numbers. 


Proof: Logical negation 2 -A- 2 is fixed-point-free. Indeed, r(false) f false, and 
r(true) f (true). For the second statement, all we need to know about R is that it 
has a self-map r such as 


x(x) = x + 1 


for which r(x) f x for all 1 


Definition 7.15: For sets X and Y we write X < Y when there exists at least one 
monomapping from X to Y . We write X <Y to mean X < Y and that moreover no 
surjective maps X — Y exist. 

Corollary 7.16: 


X < 2 X for all X 


Indeed 


X < Y x for any Y with 2<Y 


Another frequently cited application of Cantor’s argument shows that there arc 
strictly more real numbers than rational numbers. This can be established in three 
steps as follows: 


Q = N< {0, 1,2, 3,4, 5,6,7, 8} N < R 

That is, we separately establish (by a snakelike counting of fractions) that the set of 
rational numbers is isomoiphic as an abstract set with the set of natural numbers, 
and we note that among the reals there are those (nonnegative ones) whose decimal 
expansion involves no 9’s. The latter set is equivalent to the function space Y x 
indicated with a sequence N — {0, 1, . . . , 8} mapping to the real with decimal 
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expansion 


oo 

Go ■ « 1 « 2«3 • - ■ = ^2 a n 10 - " 
n = 0 

Thus, we can apply our argument above with X = N, Y — {0, . . . , 8}, noting that 
the finite set Y does indeed have endomaps that move every element. 


7.4 Additional Exercises 

Exercise 7.17 

Show that the truth-value object Q is an injective object in any topos. More generally 
show that the mapping set is an injective object for any object X. 

Exercise 7.18 

Find all of the injective objects in the categories S / X. 

Hint: Start with the case X = I (i.e., the category of abstracts sets and mappings). 

Exercise 7.19 

2°p 

Find all of the injective objects in the category S . 

Exercise 7.20 

The distributive law holds in any topos. Verify this explicitly for S 2 . 

Exercise 7.21 

Categories with finite sums and products (including 0 and 1 !) in which the dis- 
tributive law holds are extremely important in theoretical computer science. For 
example (see Walters [Wal91]), let /, g : A — >■ B and suppose <p : A — ;-2 is a 
“test function”. Show that the triple composite 

( / 

A Jidl A x 2 — A + A— — B 

can be interpreted as “if <p then / else g”. Indeed, if we call the composite 
h : A — >- B , then 

, , x _ / /(«) if <P(a) = Vue 
7 a \ g(a) otherwise 
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Exercise 7.22 

The concept of natural transformation r from a functor F to a functor G is defined 
in Appendix C.l. 

(a) Show that there is a natural transformation ot from the functor dom : 5 2 1 — 

to the functor cod : S 2 " ■ — > S (see Exercise 6. 14) whose component for an 
object X of 5 2 ° P is the mapping Xy — X v . 

(b) Show that for any set B there is a natural transformation ft from the functor 
— x B : S — (see Exercise 5.17) to the identity functor 1$ : S — >S. 

Hint: The components arc projections. 

(c) Functors can be composed. For example, the composite of the functors 
A : S — 3 - S x S and — x — : S x S — 3 - S (see Exercise 5. 17) is the functor 
(— x — )A : S — 3 - S whose value at a set X is X x X. Show that the product 
projections provide the components of two natural transformations from the 
functor (— x — )A to the identity functor 1 5 : S — > A. 
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8.1 Images 

We have been introduced to the contravariant functoriality of 2 X , which should be 
understood both as 

(1) a special case of the composition-induced contravariant functoriality (X — 5 - Y 
induces V } — >- V x ) of V-valued function spaces (5.5), and 

( 2 ) the operation of inverse image on parts 


r x B 


internalized using the special property that V — 2 has of encoding parts via char- 
acteristic functions (2.30, 2.34). 

2 X 1 1 

The covariant functor 2 (obtained by composing the contravariant 2' ’ with 
itself) can be interpreted to consist of all “classes” of parts of X, since a map 
2 X 2 is the characteristic function of a definite paid (or class) of the set 2 X of 
parts of X. 

But we now want to consider an important way of making the smaller 2 X a 
covariant functor of X. 

First let us agree to use a somewhat more convenient notation for parts. A paid 
of X consists of two components: 

(1) the underlying set |A| of the paid, which is the domain of the other component, 
and 

(2) if 1 , which is the monomorphic given inclusion of A into X. 
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To say that A and B are equivalent parts of X means that there is 


\A\ * -\B | 





with h invertible and with the equation 

i B h = i A 

satisfied (which is denoted A = x B , see Definition 2.24). By contrast |A| = |fi| 
means merely that |A|, | B\ have the same number of elements but not necessarily 
in a way that respects the inclusions. We commonly say that “A is a three-element 
part of X ” to mean that A is a part of X such that |A| has three elements; two 
given three-element parts of X could have no elements in common, could overlap 
nontrivially, or even could have the same elements of X as members, but only in 
the last case would they be isomoiphic as parts of X. Once this much is clearly 
understood, one then usually follows the “abuse of notation,” which drops the | | 
sign and just uses the same symbol A to stand both for the part (which involves a 
given i A as understood) or for the domain of the part; one then sees from the context 
whether morphisms of parts (i.e., inclusion relation) or maps of the underlying sets 
are being discussed. Recall that our notion of membership is 

T 

x G A if and only 
if 3a[x — i A a } 

X 

For any X y we will construct an induced map 



VX 

def 

ox 


Vf 


VY 
def 


This operation arises so frequently that it has at least four other notations 

'Pf = /! = /[] = 3/ = imy 

where the /[ ] suggests that it is a generalization to parts of the evaluation of / at 
elements, the 3 f suggests its intimate connection with the “there exists” operation 
of logic, and the last im f indicates that it is the internalization of the “geometric” 
operation of taking the direct image of a part of X. The latter operation, for given 
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X Y, is as follows: Given any part A of X, factorize the composite fi A into 
surjective and injective; the resulting paid of Y is called f\A]. (See the picture 
following Exercise 8.3.) 

HI suriecth ?, I/Ml 


The following exercise implies in particular that any two such factorizations will 
give equivalent parts of Y . 

Exercise 8.1 

(“diagonal fill-in”) 


3 \h Y 



If /2 is injective, p\ surjective, and jip 2 = ji Pi, then there exists an h (necessarily 
unique) for which both 

P2 = hpi 
ji = jih 


Since equivalent parts correspond to equal characteristic functions, we have a map 

2 x W 2 y 

at least if we rely on the principle that any well-defined process determines a map. 
Such reliance can be avoided by a method (similar to that used before) to deduce 
the existence of ini/ as a map from a few previously assumed specific axioms, as 
follows: 

Exercise 8.2 

Let E x denote the part of X x 2 X whose characteristic function is the evaluation 
map X x 2 X — T- 2. Show that the image If of E x along the map / x 1 2 * 

Ex ^ If 


X x 2 X ~ 


-Y x 2 X 


fx l 2 x 
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is a part of Y x 2 X , whose characteristic function Y x 2 X ■— e=-2 has the desired 

2 X ^2 y 

as its exponential transpose; i.e. 

i m/ r A n = r /[Ar 
where the further “abuse” of notation 

r A n = r characteristic function of A n 


has been used. 0 

The connection of direct image with existential quantification is laid bare by the 
following exercise: 

Exercise 8.3 

If in 



p is surjective; A, B are parts of X , Y, respectively; and fi A = i B p, then for 
any y 


y E B -<==> 3x[fx = y and x € A] 


X Y 



0 

What is called in deductive logic the “rule of inference for existential quantifier 
introduction and elimination” corresponds precisely to a geometric property relating 
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inverse image and direct image. You can prove this rule in Exercise 8.5 below, but 
first note that 


2 fr B n = = Bf 


(The notation Bf is especially apt if one thinks of B here as the characteristic 
function manifestation of the part.) If A is any part of X and B is any part of Y, 
there may or may not exist a map h for which 


141 


\B\ 


lA 


in ft A = ish 


X- 


■Y 


(it would be unique since i b is monic); if such a map does exist, we might write as 
a further definition 


AC/g 


Exercise 8.4 

A c f B <^=>- Vx[x G A =>• fx G B], 0 

Exercise 8.5 

Show that 


/[A] C Y B 

t 

A Qx Bf 

by demonstrating that both arc equivalent to A c y B, that is, 

if for any x, A(x) implies B{fx) 
then for any y, 3x[/x = y and A(x)] implies B(y) 

and conversely. (Here A(x) means x G A; i.e., we are using the same symbol A to 
denote both a part and its characteristic function, which is an abuse of notation that 
should cause no confusion since we now understand the difference.) 0 

Exercise 8.6 

Interpret the foregoing exercise in the case X = Y x T , / = projection to Y. 0 
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8.2 The Covariant Power Set Functor 

The covariant power set functor V is by definition 

VX = 2 X 
Vf = im/ 

The functoriality as usual refers to an equation: 

Exercise 8.7 

For X — ^ Y — U- Z one has 


V(gf) = ( Vg)(Vf ) 

as maps VX = 2 X — > 2 Z = VZ. In other words 

(gf)[A] = g[f[A ]] 

for any part A of X. 

Hint: Use the diagonal fill-in exercise (and the fact that the composites of surjectives 
are surjective) to conclude that the composite on top in the following picture is the 
other half of the image factorization of ( gf)iA- 

A 

n P P 


X j ? -Z 

Alternatively, (introducing properties, elements, and existential quantifiers) show 
that for any z and for any property . . . , 

3v[(g/).r = z and . . .] <(=► 3v[gy = z and 3x[fx = y and . . .]] 

0 


The claim that Vf, 


flA ] 


is one generalization of the basic evaluation 

x i — * f(x) 


on elements becomes more clear if we first internalize the idea that parts are gen- 
eralized elements in a definite manner 
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known as the singleton map, which assigns to every element x the name of the 
characteristic function of x considered as a paid of X. This map can be constructed 
in basic steps as follows: The diagonal map 

X^X x X 

is a paid of X x X (since it is split by the projections) and hence has a characteristic 
map 

X x X-^^2 

where (writing atpb for the typical value tp(a, b) of a map A x B^C) 

xt© x X 2 = true x\ = xi 

But like any map whose domain is a product, © z has an exponential transpose 

X — ^2 X 


Exercise 8.8 

If A x B — ^ C has the two exponential transposes A -A- c B and B C A , and 
if A = B, then 

/i = fi 4==>- f(a, b ) — f(b, a) for all (a, b) e A x A 

0 


The latter condition is satisfied for © x . Hence, the diagonal map gives rise by 
the procedure above to (not two, but) one map 

X-U-2 Z = VX 


such that 

x' € {x} <^=> x' = X 

Now the statement that V f generalizes evaluation at elements becomes the fol- 
lowing naturality (see “natural transformation” in Appendix C.l): 

Exercise 8.9 

For any X — Y, the square below commutes 

X 

{ } { } 

VY 


vx- 


pf 


0 
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In case / is itself a monomorphism (hence makes X the underlying set of part 
of Y), V f can be calculated without any real use of existential quantification since 
fi A is already injective (hence does not need to be further factored): 

\A\ — \f[A\\ 

J I \M\ = \ A \ 

if [A] = fiA 


Proposition 8.10: Iff is a monomorphism, then 2 f is a split epimorphism; in fact 
V f is a section for 2-f 


Proof: For any / one has 

f\A\ C B if and only if A c Bf — 2 f (B) 

so in particular 

/[A] C /[A] if and only if A C 2 f (/[A]) = (2 f o Vf)(A) 
i.e., 

A C (2 f Vf){A) 

for all parts A of X. But if / is monic, then icpf)A = fiA, whose inverse image 
2 f ) is reduced to A. In other words, the square 


A 


A 

i (Vf)A 


X 


f 


Y 


is almost trivially seen itself to satisfy the universal mapping property of an inverse 
image square, that is, 

A - 2 / (C Pf)(A)) for all A in VX = 2 X 

We conclude that 2 Y 2 X is surjective, split by V f . ■ 


The contravariant power set functor 2 ( 1 and the covariant power set functor V both 
come up in geometry and analysis in many ways, for example in connection with 
the condition that a map be continuous or that it be locally bounded. A continuous 
space (topological space) X is often considered to consist of an underlying abstract 
set | AT | of points, together with a set T x of “closed parts” and a structural map 
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|X| x T x 2 indicating when any given point is a member of any given closed 

part. The transpose of G x , F x — ^2 |Z| = V\X\ interprets each closed part of X 
as a part of the set |X|. Usually, special conditions arc assumed on Tx- ^x, but 
we will not need these here (however see Exercise 8.11 for a class of examples.) 
On the other hand, a local boundedness space (= bornological set) X involves an 
underlying set |X| of points, a set B x of “bounded parts,” and a membership map 
|X| x B x — U 2; this is apparently exactly the same sort of thing B x — *- 2 X as a 
continuous space, except for the different words. The special axiomatic conditions 
usually imposed on a system of bounded parts are, of course, quite different from 
those usually imposed on a system of closed parts (for example, an arbitrary subpart 
of a bounded part should again be bounded, whereas no such thing is true for closed 
sets; the whole paid lx is always considered closed, but lx is bounded only for trivial 
boundedness systems). However, the striking contrast between closed and bounded 
is seen in the concepts of continuous versus bornological mapping. Namely, if 
X and Y are continuous spaces, a continuous map X — Y is a map |X| — »- |T| 
of points together with a proof that the inverse image of closed parts is closed'. 

Tt 

Ty------>T X 


2 |y| ' 2 i/i - 2W 

f 

By contrast, if X and Y are local boundedness spaces, a bornological map X — ->■ Y 
I f I 

is a map | X | — >■ | Y | of points together with a proof that the direct images of bounded 
parts are bounded'. 

Bx - - - By 


r\X\ — -V\Y\ 


Exercise 8.11 

A metric space X involves a set | A | of points and a distance function 

m x 

where M“ 0 = {r|0 < r < oo} is the set of (nonnegative, extended) real numbers 
and d x is required to satisfy 

d x {x, x) = 0 

dxix, y) + d x (y , z) > d x (x, z.) 
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for any (x, y, z) e |X| 3 . This inequality can be pictured as 



X z 


f 

A map X — Y of metric spaces is (often required to be) distance-decreasing in 
the weak sense that 


d Y (fx 1 , fx 2 ) < d x (x i,x 2 ) 

for any (jci, x 2 ) e \X 2 . Each metric space has a natural notion of bounded paid: 

A part B of |X| is bounded if and only if it is contained in some ball with some 
center and some finite radius, i.e., 

dpf 

B e Bx 3xo3 r[r < 00 and Vx[x e B =>■ d x (x, xo) < r]] 

But each metric space also has a natural notion of closed part, namely, any part 
containing each point that can be approximated by points known to be in the part: 

A e T x <t=4> Vx[Vr[r > 0 =>■ 3o[n e A and d x (x, a) < r]] =>• x e A] 

Show that any map / of metric spaces (which is distance-decreasing in the weak 
sense) is both continuous (in the sense that 2? carries the natural closed parts of 
the codomain back to natural closed parts of the domain) as well as bornological 
(in the sense that V f carries the natural bounded parts of the domain to the natural 
bounded parts of the codomain). 0 


8.3 The Natural Map VX 


r2 () 


Both the functor V and the (larger) twice-contravariant 2 Z arc covariant; we will 
describe a direct natural comparison 


VX 


L 


X 2 2 


between them (which will turn out to be injective). Given any paid A of X, 

/ ( )A 
Jx 

will be a map 


2 
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namely, the one whose value at any X — 2 is the truth value (in 2 = {0, 1}) 

/ c p{x)A{dx ) 

JxeX 

calculated as follows: if there is any x in A at which <p takes the value 1, then the 
answer is 1 , whereas if (p restricted to A is constantly 0 (<p might take value 1 outside 
of A; that does not matter), then the answer is 0. Of course, the cp ’ s arc “really” 
(characteristic functions of) parts also, so the description of f x could equally well 
be stated: for any given part A in VX, consider any paid i p in 2 X and ask if tp 
intersects A. If so the answer is 1, otherwise 0. For example, 


fx <pi(x)A(dx) 
f x <p 2 (x)A(dx) 
Jx <P3 (x)A(dx) 



Thus, 



2 


determines an element of 2 , but this can be done for any paid A in VX, and hence 

2 X y 

there should be a map VX — m2r . The reason the domain is not being called 2 
will be seen in Exercise 8.13 below. 


Exercise 8.12 

The existence of the map J x can be deduced from previous concrete constructions 
and its nature clarified for later use. Note that 2 = V 1 and that there is a canonical 
“union” map 


VV1 


fM 


VI 


V2 


that acts as follows: 

For each of the four parts of 2, 


2 


Ml ({0, 1}) = /x 1 ({1})= 1 

Mi({0}) = Pi0 = 0 
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Moreover, the covariant functoriality of V (discussed in the previous sections) may 
be considered (further internalized) as a canonical map 

Y x x VX — >VY 
/> A 1 — >■ f[A\ 

Hence, in particular, taking Y = 2 = V 1 , we have 

2* x VX >VVl 



VI = 2 


and therefore by taking the exponential transpose of the composite we have 

VX^2 Z 

which is the same as our f x . 0 


Exercise 8.13 

f 

For any X — >■ Y, the square 


VX 

Vf 

VY 


is commutative. 



2 


2/ 



0 


Exercise 8.14 

a 2 X 

Let X — >■ 2 be the evaluation map 8(x)(<p) = cp{x). Then the triangle 

X 


VX 




is commutative. 


0 


? x 

Now only some of the elements of 2 Z come from elements of VX by our j x 
process: 


Exercise 8.15 

If 2 X 2 is such that there is an A for which 

a ((p) = / <p(x)A(dx) 

JxeX 
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for all X — 2, then ot has the following properties: 

a(0x) = 0 

where Ox : X — s- 1 — 2 is the constantly-0 function, and 

ot((p\ U cp 2 ) = cup\ V acp2 
<Pl 

for any X 2, where 

J <P2 

dcf 

isp i U cp 2 )(x) = (Pi{x) V cp 2 (x) 

for all x, where V is defined by iq V v 2 = 1 if and only if iq — I oi' v 2 = I for iq , v 2 
in 2. Give examples of maps 2 X — 2 that do not have the two linearity properties 
here shown to hold for f x ()A. 0 

Comment: Such linear c/’s are sometimes called “grills”; they may be considered 
as generalized parts in that every paid A determines a grill, and the union of two 
grills can be defined in a way extending the union operation on parts. But there are 
many grills that are not parts. For example, in the plane X, we can define an a by 
a{cp) = 1 if and only if cp is a paid of the plane with positive area. 

Remark: All the general constructions in Sections 8. 1-8.3 (such as singleton, 
the map from the covariant power set to the doubly contravariant one, etc.) are 
applicable in any topos with 2 replaced by G. 


8.4 Measuring, Averaging, and Winning with V -Valued Quantities 

Several natural restrictions can be placed on elements of V yX to yield a covariant 
subfunctor 

Horn ?(V X , V ) 4 > V v * 

of mathematical interest. When V is equipped with some structure, as for example 
with V = 2 above, conditions similar to lineality may be imposed. But let us briefly 
consider two conditions that might reasonably be defined using only the knowledge 
that V is a given set. We want that at least the evaluation functionals V x — V 
(given by x(<p) — <p(x) for all cp (i.e., x — i(v)) be included, and this will be true of 
both our conditions. 

Suppose that every month the same set X of people are to be asked which of a 
set V of three brands of a product they prefer, giving a result X — V each month. 
We want to devise a single procedure a that will give a(cp) an element of V so that 
we can reasonably say that the people prefer v = a((p) this month. One obvious 
condition that a will need to satisfy is that it should be an averaging functional to 



8.4 Measuring, Averaging, and Winning with V -Valued Quantities 


149 


the extent that if it should happen that all people choose at <p the same brand v, that 
same v should surely be the result a((p). 

Definition 8.16: V x — ^ V is weakly averaging if for each v, ifcp v (x) = vfor all 
x, then 

ct((p v ) = v 

This is obviously an extremely weak condition and may be satisfied by functionals 
that nobody would want to consider as averaging. (When V has additional structure, 
it makes sense to require that a also be linear, in which case there are often theo- 
rems saying that a linear averaging procedure is an expectation with respect to a 
probability distribution on X.) 

Proposition 8.17: For any x in X, V x — U- V is weakly averaging. That is, if Homy 
denotes the constant-preserving functionals, one has 



Proof: Obvious. ■ 

Thus, a trivial way of defining a weakly averaging V x — => V is to choose some 
single individual x in X and then evaluate every survey <p by merely noting how v 
answered and presenting his or her answer as the “digested result of the survey”. 
This offends our sense of fairness to the extent that we arc led to formulate a 
different sort of condition (which we will not study here) that could be imposed on 
an averaging functional: 

Definition 8.18: The arrow V x “> V is said to be symmetric if for every invert- 
ible self -map ( also called automorphism or permutation) ofX, X — > X 

atypo) = a(cp ) 

for all X — V. 

We will emphasize instead a certain kind of highly skewed weakly averaging 
functionals. The idea is that one might hope to recover X from 3> = V x by putting 
strong enough conditions on maps d> — >• V to characterize the values x of S in 
case <f> — V x . Such considerations are basic to almost all situations in algebraic 
geometry, particle physics, functional analysis, and so on, wherein one wants to 
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locate or recapture points of space or states of motion X by comparing (in quantita- 
tive ways) observable variable quantities in V x with (relatively) constant quantities 
in V. 

Definition 8.19: A V -generalized point ofX is a functional V x — ^ V such that 
for each V ' V, 

a (ftp) = k(a(cp)) 

for all X v >- V, where ktp denotes the composition and k(v) denotes usual evalu- 
ation. That is 


x V 


x 


lyV *-Ct 


V 


x . 


■ V v x V 
eval 

— V 


should commute. 

This condition is quite strong. For example, if V denotes the real numbers and 
a is an expectation, and if we consider as an example of k the operation of multi- 
plying by a constant (e.g., doubling), then 

a(kcp) = k(a(cp)) 

follows from lineality. But instead take k to be squaring: just the case 

afp 2 ) = a(cp ) 2 

of our condition forces (the distribution underlying) a to be so concentrated that 
the standard deviation (relative to a) of all random variables <p is zero ! 

Proposition 8.20: Each point is a generalized point. 

Proof: If a — x, then for any X ¥ '>- V x >- V 

a(ktp) = ( ktp)x = k(cpx) = k(a(cp)) 


Exercise 8.21 

Any generalized point is weakly averaging. 

Hint: Each element of V determines a constant self-map of V, V c >■ V v , and 

this can be used to show that 

X c ^ Hom y v(V z , V) c ^ Hom y (F x , Vf ^ V v * 
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where Hom y v denotes the generalized points, that is, the functionals a that are 
homogeneous with respect to all A in V v . 0 

To what extent all generalized points are really just the original elements has 
long been of interest. Isbell showed in 1960 [Is60] that if V is a given infinite set, 
then 


Z-^Hom y y(V x , V) 

for all sets X that arise in ordinary geometry and analysis even though by Cantor’s 
theorem most of these X’s are larger than V. However, many set-theorists study 
the possibility of the existence of sets X so large that they cannot be “measured” 
by any fixed V in this way (paradoxically, these hypothetical extremely large sets 
are usually called “measurable cardinals”). 

For a small V, like V = 2, it is easy to find examples of generalized points that 
are not points but are interesting in view of some applications: V v has four elements 
A, of which the identity \ v contributes no restriction on generalized points. Thus, 

Proposition 8.22: Afunctional 2 X — ^ 2 commutes with all X in 2 2 if and only if 

a( r X^)=\, c*(0) = 0 

a((p) = 1 •<=>■ a(-u p) = 0 

(where —> : 2 — >■ 2 is defined by ->0 = 1, — 0). ■ 

Such generalized points a arc sometimes considered to arise from a context in 
which the elements of X can “choose sides” in various ways to produce an alignment 
<p that will either “win” a(<p) = 1 or “lose” a(cp) = 0. (The following is a further 
restriction on a sometimes considered: if <p c ip' and a(cp) = 1 then a(tp') = 1. 
However, it is not true in all situations a that a team <p' bigger than a winning team 
<p would also win (consider what may happen if too many CIA and MI5 individuals 
infiltrate our team)). We want to see that there are situations a that arc generalized 
points of X in the sense that they preserve the action of all A e 2? but arc not points 
vo; in other words, the situation a is more complex than one in which there is such 
a towering star player xo that for any ip, ip is a winning team if and only if xo is on 
ip. If X is too small, such complexity will not be possible. 

Exercise 8.23 

If X — V, then X — ^ Hornby z , V ). Take the example ip = lx but note that in 
this case a condition A in \ /l may be considered also as an arbitrary input for a 
functional. 0 
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Thus, for V = 2, we need to take X with at least three elements to find generalized 
points a which do not come from points. 

Exercise 8.24 

If X has n elements, show that the number of maps 2 X 2 that commute with 
the action of all four k in 2 is 


For the case n — 3,X = {a, b. c}, determine explicitly all eight a ’s by displaying, 
for each a all the teams tp that win a. 

(Such <p can conveniently be displayed as subsets of {a, b , c}.) 0 

Exercise 8.25 

Let V — 3. Show that the 27 conditions a generalized point must satisfy all follow 
from a much smaller number among them. Show that any generalized point of any 
finite set X is in fact a point of X itself. 0 

Remark 8.26: The recovery of X from measurement, i.e., 

I^>Hom y r(y x , V) 

is also valid when X is “any” metrizable continuous space, V = M (the space of 
real numbers), and all maps arc continuous. 0 


8.5 Additional Exercises 

Exercise 8.27 

To define the covariant power set functor we used only the elementary (topos) 

f 

properties of S. Describe im f for an arrow X — >■ Y in S/X. Note that to do this it 
is first necessary to describe the objects VX (recall Exercises 5.12, 6.8). 

Exercise 8.28 

f -^Op 

Describe im f for an arrow X — Y in o . 

Recall Section 6.2 and Exercise 6.13. 

Exercise 8.29 

Describe im f for an arrow X Y in M-sets. 

Recall Exercises 5.13, 6.15. 
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Exercise 8.30 

Similarly, to define the singleton arrow, X -^PX, we again used only the ele- 
mentary (topos) properties of S. Describe X * > VX for an object X in each of 
S/A. S 2 1 and M-sets. Keep in mind that 2 in the construction of Section 8.2 must 
be replaced by the appropriate truth-value object. 

Exercise 8.31 

Show that the diagonal arrow X — X x X provides components of a natural trans- 
formation (see Appendix C.l) from the identity functor to the composite functor 
(— x — )A(see Exercise 5.17). Find a natural transformation from the composite 
functor A(— x — ) to the identity functor on S x S. 
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9.1 The Axiom of Infinity: Number Theory 

Recall that we denote by S the category of (abstract, discrete, constant) sets and 
arbitrary maps between them that we have studied till now. In the various branches 
of mathematics (such as mechanics, geometry, analysis, number theory, logic) 
there arise many different categories X of (not necessarily discrete, variable) sets 
and respectful maps between them. The relation between S and the X’s is (at least) 
threefold: 

(0) S is “case zero” of an X in that in general the sets in X have some sort of 
structure such as glue, motion and so on, but in S this structure is reduced to 
nothing. However, the general X often has a functor X 1 S determining the 
mere number (Cantor) |X| of each such emergent aggregate X. 

(1) A great many of the mathematical properties of such a category X of variable 
sets are the same or si mi lar to properties of the category S of constant sets. 
Thus, a thorough knowledge of the properties of S, together with some cate- 
gorical wisdom, can be indispensable in dealing with problems of analysis, 
combinatorics, and so forth. The main common properties include the concepts 
of function spaces X T and of power sets V(X). 

(2) Many examples of categories of variable sets X are reconstructible from S as 
X = S l '° p , where T is a datum (which can also be described in S) whose role is 
to specify the general nature of the glue, motion, and so on, in which all sets in 
X participate, and X is considered to consist of all possible examples having 
that particular general nature. (Sometimes such a datum T is called a theory; 
we will explicitly consider a few such theories, some of which can actually be 
extracted in a fairly simple way from X itself.) 

The connection between X and 5 10p often comes about in the following way: 

among the variable sets in X there arc a few special ones T c >■ X such that 

knowledge of the \X T \ for T in T and their interactions as constant sets (in S ) 
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suffices to reconstruct any variable set X. (T =f was considered in Chapter 6.) A 
basic example of a mode of variability of sets is T = Q . That is, we consider a 
category S ^ of variable sets X in which X has not only elements but also an in- 
ternal “dynamic” of the sort that any given element uniquely determines a “next” 
element, and the maps between sets respect this internal dynamic. The category 
S can be regarded as constructed from S as follows: 

A set X of S ^ “is” (i.e., determines and is determined by) a set of S together 
with an endomap as structure 

in 5 

A map of 5^ “is” two sets T'G 1 ' and a mapping / in S satisfying the equa- 

tion in S, 

fHx = £r/ 

so the following diagram commutes: 



Composition in S ^ of two maps X^^ x — G jGG — G zG?z is only defined in 
case the codomain of / and the domain of g (not only have the same underlying 
abstract set of elements but also) have the same internal dynamic if Y . In that case the 
composite is formed by forming the gf as in S and forgetting f Y while retaining fx 
and §z- Note that we will sometimes abuse notation and not mention the dynamic 
of aGG. 


Exercise 9.1 

If f$ x = Hrf and if g% Y = Hzg, then (gf)'fx = tjzigf )• That js, the composite de- 
fined above really does give a result that is again a map in S . 0 


Exercise 9.2 

Let 1 denote a one-element set equipped with the only possible dynamic (endomap). 
For any X in S there is a unique S '' "’-map X — >- 1. Show also that S G has 
an initial object. 0 


Exercise 9.3 

G 

A map 1 -A- A in S ^ is always the same thing as a. fixed point of the dynamic of 
X, that is, an element x of the underlying abstract set of X for which 

Hx* = x 


0 
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Thus, the maps from T — 1, although an extremely important aspect of the sets X 
in S fall far short of detecting all the elements of the underlying set of X ; another 
object T = N is necessary for that puipose, and that will force the existence in S 
itself of infinite sets. In more detail, the underlying abstract set of N (also denoted N) 
must be equipped with an 5-endomap N^ n (usually called “successor”) of the 
underlying set (in order to be a set of S ^ at all), and we want there to be a natural 
invertible correspondence 


N^XmS^ 

1-^XinS 


n 


Taking the case X — N, x — l^v, we deduce that there must be a distinguished 
element 

r\ 

of the underlying abstract set of N. Since just to say N - >■ X in 5 is to say that 
xo = in S, we arc led to the following statement characterizing the system 

1 °> N^ a 

r\ 

of maps in 5 in terms of S only: 


AXIOM: DEDEKIND-PEANO 

There exist 1 — N — T- N in S such that for any diagram 

1 

in S there exists a unique sequence x for which both xO = xq and xct = fx 


N 2 



Here “sequence” is the standard name for maps in S with domain N. The equation 
expressed by the commutativity of the square is the condition that x be a respectful 
map in variable sets, and the triangular equation expresses that the element xo 
is being represented by x. Notice also that the axiom is a statement about S. It 
is precisely the axiom of infinity we promised in Chapter 6. With this axiom we 
have completed our listing of axioms for S. In any category, an object N (or more 
precisely, the diagram 1 — N N ) satisfying the Dedekind-Peano axiom is 

called a natural number object. 
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Now the claim that N is not finite can be justified by examples of X. For example, 
one variable set in S ^ is 

X 5 : 

i.e. £(x) — x + 1 x < 4; £(4) = 4 

O 

Hence, there must exist a map N A >■ X 5 in 5 with this five-element codomain, 
that is, a map in S for which 

x0 = 0 

xaO = xO + 1 — 1 
xaaO — xo 0 +1 — 1 + 1— 2 
xoooO = xooO +1 — 2+1— 3 
xo o o crO = xooo 0 +1— 3+1— 4 
xoooooO = £(4) = 4 

Hence, 0, erO, oo 0, oooO , ooooO must be distinct elements of N since they have 
distinct values under at least one map x. Of course under this x all the elements 
cr 5 0, cr 6 0, . . . get the same value in X 5 . But the single set N is supposed to work for 
all X, so one could take X — X (] to see that there is at least one x that distinguishes 
cr 5 0 from er 6 0, and so on. Thus, in N all elements obtained from 0 by successive 
application of the successor map o are distinct. If N had other elements than those so 
obtained, an X could be constructed for which two mappings N > X represent 
the same element of X, contradicting the uniqueness paid of the universal mapping 
property of N. Thus, we are led to the following intuitive idea: 

N — {0, 1,2,3,...} on = n + 1 

9.2 Recursion 

The universal property of the successor o says (using the usual notation x„ — xn 
for the values of a sequence) that for any given “next step” rule X ^ on any set X 
and any given stalling value 1 — X, there is a unique sequence x in X such that 

xo = the given stalling value 
x„+i = £(x„) for all n 

The sequence x is said to be defined by simple recursion from xo, £. We will prove 
in Lemma 9.5 the existence of sequences defined by somewhat more general kinds 
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of recursions as well. First note that we can formally prove one of the characteristic 
properties of infinite sets. 

Theorem 9.4: I + /V — N. The successor map is injective but not surjective, 
therefore N is Dedekind infinite. 

Proof: By the universal mapping property of coproduct there is a unique / for 
which 



We want to define an inverse N \ + N = X for / by recursion. But the 
“recursion” satisfied by g is 


gO = /* 

g(n + 1) = i N (n) 

in which the right-hand side is not simply some function £ of the “previous value” 
g(n) but instead is some function of n and does not mention g{n). Even dependence 
on both n and gin) still permits recursive definition: 

Lemma 9.5: {Recursion Lemma) If N x A — 'L*- A is any given map and if 1 A 

is any given element, then there exists a unique sequence N -A- A for which 

g(0) - a 0 

g(n + 1) = h(n, g(n )) 

Proof: (The discrete version of a standard method for dealing with nonautonomous 
differential equations: augment the state space to include also the time coordinate). 
Define 


X = N x A, a) = (n + 1, h(n, a)), xq = (0, afi 

Then by simple recursion there is a unique x for which 

x(0) =xo = (0, ao) 
x(n + 1) = ^x(n) 
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But if we define u and g by 


so that xn — (u n, gn) 



u — pnX 
8 = Pax 


(uO, g 0) = (0, a 0 ) 

{ u(n + 1), g(n + 1)> = £ (u(n), g(n)) 

— { u(n ) + 1 , h(n, gn)) 


by definition of £ . Hence, by taking projections 

u{ 0) = 0 

u(n + 1) = u(n ) + 1 


that is, u — 1 w by uniqueness, and 

g( 0) = a 0 

g(n + 1) = h(n, gn) 

as required; g is uniquely determined since a: is. ■ 


Corollary 9.6: There is a unique map N P ^ N called the predecessor map for 
which 

p{ 0) = 0 
per = l N 

Inotherwords p{n + 1) = nforalln. (Hence o is injective since it has a retraction.) 


Proof: Take the first projection from N x N for h in the lemma. This is the opposite 
extreme (from the original axiom) of the lemma, namely the rule N x A — 5 - A 
(with N = A) for computing the next value depends only on N rather than 
on A. ■ 


Now we can complete the proof of the theorem 1 + N — N because the inverse 
g can be defined by 


g(0) = /* 

g(n + 1) = i N (n) 
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(similar to, but not identical with, the predecessor function). Then 

fg{ 0) - 0 

(/§)(« + 1) = fh{n) 

= a{n) = n+ 1 


so that fg = 1 # by uniqueness of functions defined by recursion. Also 

gfi* = g(0) = /'* 
gfi N = go = in 

so that gf = I | +/ v by the uniqueness of maps from coproduct sets. Hence, 

i + n^n. m 


9.3 Arithmetic of N 

We also want to show that N x N = N , but we need first to develop some of the 
arithmetic of N to be able to define the comparison maps. 

Much of the arithmetic of N is implicit in the internalization of the recursion 
process itself. First note that 

Proposition 9.7: Given any map A-^A there is a sequence N-^-^A a such 
that 

Ct°= l A 

oi" +l = act " ( composition ) 

{note the abuse of notation which ignores r n ). 

Proof: Apply the Dedekind-Peano axiom to the case where X = A a , xq = r I ; i n , 
and where the “next stage” endomap 

-t>- a a 

is the one that composes any map f> with a: %(/ 3 ) = af> for all 1 —^A a . ■ 

Internalizing the proposition, we get 

Theorem 9.8: For any set A there is a map 

A a {A a ) n 


that assigns to any a the {name of the) sequence of iterates of a. 
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Exercise 9.9 

Show directly that iter A exists. 0 

For example, if A = N we have a procedure 

n N iterjy ^ (yy.VyV - n NxN 

which assigns, to each map N “> N, a binary operation a on /V satisfying 

17(0, m) = m for all m 
a(n + 1 ,m) = a(a(n, m )) for all n , m 

For example, if we take a — o, the basic successor operation itself, then the 
iteratively derived binary operation satisfies 

ct ( 0 , m) = m 

o{n + 1 , tn) = ct ( ct («, /??)) = er(n, m) + 1 

This proves the existence (as a map) of the operation of addition of natural 
numbers 

o(n , m) — n + m 

in which notation the above recursion rules for generating ct become the familiar 

0 + m — m 

(n + 1 ) + m = ( n + m) + 1 

Diagonalizing the addition map 

N N x N N 

we get the map N — N, which is usually denoted by 2 • ( ), “multiplication by 
2”. The iteration process can be applied to the latter to yield the binary operation a 
for which 

a(0, m) — m 

a(n + 1 , m) — 2 • a («, m ) 


Exercise 9.10 

Determine the operation a satisfying the preceding recursion relations. 0 


Exercise 9.11 

Use the maps 2 • ( ) and 2 • ( ) + 1 to set up an explicit invertible map 
N + N ^ N. 


0 
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To obtain the existence of multiplication as a map, note that it should satisfy the 
recursion 


0 • m — 0 

(n + 1) • m = n ■ m + m 

More generally, given any set A equipped with an “addition” (assumed associative) 
AxAi A and a “zero” 1 — A, we can simply apply the iterator to the trans- 
pose of + and follow the result by evaluation at 0 

A A A Jte^ (A a ) n A n 

obtaining a map whose transpose is the map 

WxA^>A 

which is usually called multiplication. 

The idea for proving N x N — ^ N is based on the observation that the fibers of 
the map N x N + =- N are finite (in fact the nth fiber has length n + 1), and so it 
should be possible to define a map N — >■ N x N that runs through each fiber one 
after another, thus eventually running through all of IV x IV (without repetitions): 



f 

The inverse N x N >■ N of the desired enumeration N — >- N x N will indicate 
the number of steps that have been traversed in the enumerated path in reaching a 
given point (x, y). The fiber of + in which (jt, y) lies is determined by x + y = n, 
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and at the top (beginning) of the nth fiber the path has taken 

sr^ n . .. n (n + 1) (x + y)(x + y + 1) 

P k + D=^— = 2 


steps. Thus, at the point (x, y) itself, where x additional (downward to the right) 
steps have been taken, we get 

(x + y)(x + y + 1) 

/(*, y) = ^ 

This quadratic expression is actually injective and surjective when considered with 

f 

only natural numbers in its domain and codomain, N x N — ->■ N. The best proof 


of that is to recursively define the inverse map N 
are often called “pairing functions”. 


{x,y) 


N x N whose components 


Exercise 9.12 

Show that for any set A equipped with a commutative and associative operation 
A x A -i*- A, with zero 1 °> A, there is a map 


A n 



A n 


that assigns to any sequence a its sequence of partial sums 

n — 1 

a)n = Qk 
k = 0 

(note that (J] a)o = 0 independent of a). 


0 


The verification that, for example, the composition of two sequences defined 
by recursion is equal to another given sequence defined by recursion can often 
be done just by verifying that the effective recursion data arc the same. But more 
generally, such equality statements are usually proved by the method known as 
induction. Note that the equalizer of two sequences is a paid of N, therefore the 
usual statement of induction involves parts of N rather than equations. 

Theorem 9.13: (also known as Peano ’s Postulate). If A is any part of N, one can 
conclude that A is actually the whole of N, A N, provided one can show that 
both 

0 e A and 
V/7 [n e A =>■ 77 + 1 e A] 


Proof: We need to recursively define an inverse x : N — >- A for the inclusion /a 
of A in N. By the second paid of the hypothesis there is a map A -A- A for which 
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i A cr A —oi A . Since, moreover, 0 e A, there is by recursion a unique N %■ A for 
which 

vO = 0 
xa = o A x 

Then, 

i A x 0 — 0 
i\xa ~ = - i \o a x 
= aifiX 

= cr(i A x) 

Therefore, 

i A x = l,v 

as the unique solution of the recursion problem whose solution is 1 n. Furthermore, 
we have i A (xi A ) = (i A x)i A = I^/'a = /'a 1a, and so xi A = 1a since i A is monic. 
Combining these shows that i A : A^^N, as claimed. ■ 


Now the equation 




1A 


OA 


X- 




N 

O 

N 


i A o A = cri A 
i A monic 


means that A^ aA is a part of N'^ >a in the sense of the category S ~ ^(scc Exercise 
9.18). The induction theorem says that if such a paid contains the element 0 in its 
underlying abstract set, then it can only be the whole N. But how many parts of N 
(in the sense of S are there altogether? There is of course the empty part. But, 
moreover, given any element of N, for example 5, the representing map 

iV^iV 

O 

is a paid of N (also in S x ) whose members are 5, 6, 7, 8 ... . It should be clear 
that there arc no other parts of N in S ^ (even though in S there is a huge number 
2 n of parts of N). This knowledge of the structure of parts of N will be essential 
in understanding the nature of a different remarkable variable set V A :( I ). the set of 
truth values for S 


Exercise 9.14 

Determine the classifier TV( 1) for sub dynamical systems. 
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Hint: Let 0 stand for “true” (“already true”) and oo for “false” (i.e., “will never 
become true”) and remember that the indicator map X — ^ P v ( I ) of any paid A 
of any X must be a map in S whereas the internal dynamic of P w (l) must be 
given once and for all independently of X. Show that every map N — 5- P iV ( I ) has 
finite image. 0 


9.4 Additional Exercises 

Exercise 9.15 

Why does the statement 1 + N = N imply that successor is not surjective? 

Exercise 9.16 

Show that the addition mapping N x N N satisfies 

(77 + m) + p = n + (777 + p) 

77 + 777 = 777 + 77 
77 + p = 777 + p =5 77 — 777 


Exercise 9.17 

Show that the multiplication mapping N x N — — s=- N satisfies 

(77 • 777) • p — 77 • (777 • p ) 

77 • 777 = 777 • 77 

77 • p = 777 • p =? n — 777 or p — 0 


Exercise 9.18 

o 

Identify the monomorphisms in 5 . 

Exercise 9.19 

r\ 

Show that S ^ has products and equalizers (and hence all finite limits). Show that 
S ^ has all finite colimits. 

Exercise 9.20 

o 

Show that S '' has exponential objects. Notice that this exercise combined with 
the preceding exercise and Exercise 9.14 implies that S ^ is a topos. 

Exercise 9.21 

Show that there is an object in S/X that has an element 0 and an endomorphism o 
(both in the sense of S/ X) that satisfies the Dedekind-Peano axiom. 

Hint: The object is constant as a family, that is, as a mapping to X it is a projection. 
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Exercise 9.22 

Show that S has a natural number object. 

Hint: The object has an identity as its transition arrow. 

Exercise 9.23 

Show that the category of M-sets has a natural number object. 

Hint: The object has trivial action by M. 

Exercise 9.24 

r\ 

Show that S v (!) has a natural number object. 

Hint: The object has an identity as its structural endomorphism. 

Exercise 9.25 

(a) Show that there is a ( diagonal ) functor (Appendix C.l) denoted A from S to 
S ^ whose value at a set A is the object of A ^ given by A with the identity 
endomorphism. (Define A also on mappings and show it satisfies the equations.) 

G G 

(b) Show that there is a functor Fix from S ^ to S that sends an object A of A 
to its set of fixed points (see Exercise 9.3). 

These two functors have the following important relationship: 

Q 

(c) Show that for any set A in S and object X in S ^ there is a one-one correspon- 

o 

dence between mappings in 5 x from A(A) to X and mappings in S from A 
to Fix(X). (A is left adjoint - see Appendix C. 1 - to Fix.) 
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10.1 Monoids, Posets, and Groupoids 

The section title names three very special classes of categories that arise frequently. 
Knowledge of these categories is helpful in analyzing less special situations. 

Definition 10.1: A monoid is a category with exactly one object. 

Hence any two maps in a monoid A can be composed; this composition is 
associative and has a two-sided neutral element usually called 1. A monoid A is 
often said to arise concretely if we are given some large category £ (of structures 
or spaces), if we choose an object A of £, and if we define A to consist of all 
£-endomaps of A. 



Thus, for example, all linear endomaps of the particular - linear space A = 1R 3 , 
i.e. all 3 x 3 real matrices, constitute a monoid A, where composition is matrix 
multiplication. Here £ could be the concrete category of all //?-lincar spaces and 
maps; hence, A can be considered to arise concretely in the preceding sense. We 
will study in more detail in Section 10.3 another example, the concrete monoid 
of all endomaps (in the sense of the concrete category £ = S of abstract sets and 
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arbitrary maps) of the particular two-element set A = 2; the resulting monoid A 
has exactly four maps in it, one of which is a non-identity involution (rr = 1) and 
two of which are constant (cx = c for all x in A)\ moreover, 


tcq = ci, rci — cq 


where co, ci arc the two constants. 

Definition 10.2: In a category ( not necessarily a monoid ), the notion of constant 
map A B can be defined as follows: for every object T there is a map T — ^ B 
such that for all T A, cx =Ct- 

Note that in any category with a terminal object, a constant in the sense of 
Definition 1.25 satisfies this definition also. 

Exercise 10.3 

Show that 2 2 is constant in the sense of this definition even when we consider 

it within the whole category S. 0 

On the other hand, monoids often arise non-concretely: 

Exercise 10.4 

With each real number s associate “abstractly” a map T s and define 

1 = T 0 
T S T, = T s+t 

Show that a monoid (i.e., a one-object category) is thus obtained. Can this monoid 
be realized concretely, as the full subcategory of all endomaps of a suitable object 
A in the category of all structured spaces of some kind? 0 

Exercise 10.5 

For pairs (a, b) of real numbers, define 

{a.2, bf) ■ (oi ,b{) = {aia\, r?2^i + bf) 

Prove that this is a one-object category. Which pair (a o, bo) is the identity map 1 of 
this category? Can it be concretely realized? 0 


A second special class of categories is described by 
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Definition 10.6: A category A is a poset (short for partially ordered set) if and 

only if there is at most one map from any object to any other object: 

f = g 

g JO 


We often write A' < A (or A'CA or A' b A) to mean 

3/[A' — 5 - A] 

Sometimes it is important (though usually not) to distinguish between posets in 
general and “strict” posets in particular, the latter being those for which the further 
condition 


A' < A and A < A' => A' = A 


holds. 

A poset A is often said to arise concretely if we arc given a large category £ of 
structured spaces, choose an object X in £, and define A to be a category of parts 
of X (in the sense of £) with inclusions as maps. 

Exercise 10.7 

In a poset all maps are monomaps. If A is a category that has a terminal object and in 
which all maps arc monomaps, then A is a poset. Give an example of a category in 
which all maps are monomaps but which is not a poset. 0 

Exercise 10.8 

In any category B , we could define B' < B to mean SB’ — >■ B. Explain how this 
idea associates to every category B a poset B po and define a functor (see Defini- 
tion 10. 1 8) B — >- B po . When is this functor invertible? 0 

Exercise 10.9 

Consider triples (. x,y,r ), where x, y are arbitrary real numbers but r is real with 

f 

r > 0 . Define (. x,y,r ) — - (x, y, f) to mean that r <7 as real numbers and / is 
a nonnegative real number f > 0 for which 

(x - x) 2 + (y - y ) 2 = (7 - r + f) 2 


Define composition so as to make this a category. Is it a poset? Can it be concretely 
realized? 0 
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Exercise 10.10 

There is only one category that is both a monoid and a poset, and it is a “groupoid” 
in the following sense: 0 

Definition 10.11: A category is said to be a groupoid if and only if every arrow in 
it has a two-sided inverse in the same category: 

VA', AVA' -U A3 A -^~A'[fg = 1 A and gf = l A >] 

Usually the name group is reserved for those groupoids that are moreover 
monoids, but frequently in applications groupoids arise that are not monoids; fortu- 
nately, almost all the profound theorems proved in the past 200 years for one-object 
groups extend rather effortlessly to groups with many objects. Certainly for our 
purposes, the property of having all maps invertible will be more significant as an 
indicator of a qualitative particularity of variation/motion; the accident of being 
described in terms of a single object will mainly serve the subjective puipose of 
making some problems seem simpler. One sometimes says that a groupoid A arises 
concretely if we are given a category £ of structured spaces and we define A to be 
the category of all isomorphisms in £ 

A = Iso(£) 

that is, the objects of A are the same as those of £, but the maps of A are only those 
maps of £ that have 2-sided inverses in £. 

Exercise 10.12 

1 A is invertible for all A. If f\, /2 are invertible and composable in £, then /i/i is 
invertible in £. Hence, Iso(£) is a category. 0 

Exercise 10.13 

If g is the two-sided inverse for /, then g itself has a two-sided inverse. Hence 
Iso(£) is a groupoid. 0 

Exercise 10.14 

If we intersect the groupoid Iso(£) with the endomorphism monoid Endo^(A) of 
an object, we get 


AuU(A) = Iso(£) PI Endo^(A) 


a one-object groupoid (called the automorphism group of A in £). 


0 
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Exercise 10.15 

If A is a finite set in the category £ = S of abstract sets and arbitrary maps, then 

Aut,$(A) = A! (factorial) 


Exercise 10.16 

Show that there is a group with only three maps 

1, w, w 2 , w 3 = 1 

Can it be realized concretely? 0 


10.2 Actions 

10.2.1 Actions as a Typical Model of Additional Variation and 
the Failure of the Axiom of Choice as Typical 

The notion of “action” is defined for any (even abstractly given) category A, but 
then the actions of A arc the objects of a category A of structured spaces from 
which we can extract concrete subcategories A, which in turn (in their abstract 
guise) have actions. This dialectic is quite essential in all parts of mathematics 
since even if we start with the “self-action” of concrete A, other actions of the same 
A immediately arise by mathematical constructions such as taking the spaces of 
open subsets, of compact subsets, of continuous functions, or of measures, on the 
original spaces. Actually, there are left actions and right actions of A. Let us first 
consider the situation in which A is a given subcategory of a category £. For any 
object X of £ we can consider the algebra A(X) of A-valued functions on X (in 
the sense of £) as follows: 

(0) For each object A of A there is a set 

A(X)(A) = £(X, A) 

defined in this case as the set of all £-maps X — 5 - A. 

(1) For each element / of the set A(X)(A) and for each map A — ‘“A A' of A 
there is an associated element af of the set A(X)(A'), defined in this case as 
composition. 

(2) 1a/ = /■ 

(3) (cta)f = a(af) in A(X)(A") whenever A — A r ‘t^h* A" in A and whenever 
/ is an element of the set A(X)(A). 

Any system of sets associated to the objects of a category A together with any 
system of operations between these sets associated to the maps of A in such a 
way as to satisfy conditions (2) and (3) is called a left A-action, or simply a left 
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Al-set, regardless of whether it is given by composition in terms of some enveloping 
category £ and some object X in it. 

Exercise 10.17 

The preceding description of the action of maps in A as operations on the function 
algebra of X is not limited to unary operations because not only is A not necessarily 
a monoid, but it even may contain Cartesian products for some of its objects. For 
example, if £ is the category of smooth (C°°) manifolds X (e.g., open balls and 
their bounding spheres, tori, IR'\ etc.) and smooth (C°°) maps between them, we 
could take as A the subcategory of £ determined by the two objects IR and /A' 2 ; 
then if /), /i are any two elements of A(X)( IR) and if IR 2 - >■ IR is addition or 
multiplication, then there is a uniquely determined element / of A(X)(1R 2 ) for 
which f \ 0/2 = Of in A(X)(1R). 0 

The totality of left yl-sets becomes a category <S‘ 4 by defining the appropriate con- 
cept of map between them; in this case maps cp are often called A - h o m o m o r p h i s m s 
or A-natural maps (or even A-homogcneous maps). The crucial formula express- 
ing compatibility with the action is 

(ag)tp = a(gtp) 

for all elements g of the domain of <p and all A-maps a acting on both domain and 
codomain of tp\ since each of the domain and codomain is really a family of sets, 
one for each object A of A, correspondingly cp is really a family of set maps (pA, 
and the preceding formula is often expressed as the commutativity of the naturality 
diagram 



(Since left A-sets arc often “algebras” - as opposed to the “geometries” we will 
usually deal with - it would not be inappropriate here to use I. N. Herstein’s con- 
vention that in algebra evaluation and composition are written in the opposite order: 
thus, (g)<p would mean the value of the homomoiphism <p at the element g, and (p\l/ 
would mean first <p then 1 //; we will not be using that notation in most contexts.) 
We can sum up the discussion with the following definitions. 

Definition 10.18: If A and B are categories, a functor <t> from A to B is an 
assignment of 

• an object <3>(A) in B for every object A in A 

• an arrow <t>(/) : 3>(A) — >■ d>(A') in B for every arrow A — A' in A 
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subject to the following equations: 

• $(1/0 = 1<J>(A) 

• d>(g/) = <E>(g)$(/) whenever A A' A" 

Definition 10.19: The category of left A-sets has as objects the functors $ : 
A — >-<S. A morphism tp in from $ to A' is a natural transformation, where a 
natural transformation or homomorphism ip : $ s- O' is a family ofmorphisms 
<Pa '■ $(A) — ■> <t>'(A) satisfying 

<A>'(o,)(Pa = ^a'$(«) whenever A — A' 


Exercise 10.20 

If A C £ is a full subcategory and if X ‘'A Y is any map in £, then there is a 
natural homomorphism of left A-scts 

A(7)^A(X) 

between the associated function algebras defined by 

g^gV 

0 

Instead of trying to maintain two conventions on the order of composition (one 
for algebra and one for geometry) most authors write something like 

A(E) X- A(X) 

to denote the operation of “pulling back along < p” the A- valued functions g on Y to 
A- valued functions on X, i.e. tp*g = gtp. One can prove the functoriality formula 

(ftp)* = (p*f* 

very easily in this context. (The functor is objectively contravariant as X varies no 
matter how we subjectively write it.) 

Some basic examples of right A-actions also arise from some realization of A 
as a concrete category and the choice of an object X in an enveloping category 
£ D A. We will use (for the moment) Qg(X) to denote the resulting right A-set: 

(0) For each object A of A there is a set 

Ga(X)(A) d ^ f £(A, X) 

(1) For each element x of Q_a(X)(A) and for each map A' “> A of A there is an 
associated element xa of Q^(X)(A'). 

(2) xl A = x 

(3) x(aa') = (xa)a' in Qa(X)(A") 
whenever A" A' —‘A A in A and x is an element of Qa(X)(A). 
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Just as we called the concrete example A(X) of a left yl-set a “function algebra” 
with the action of A called “algebraic operations” on the functions, and natural 
homogeneous maps “homomorphisms so we could call QAX) the “geometry 
of figures” in X with the action of A expressing “incidence relations” among the 
figures x, x'\ the natural homogeneous maps could be called “.4-smooth”. Note that 
if x is a part of X and if x' is a given figure in X, then there is at most one a such that 
x' = xa, in which case we could just write x' — 5- x or x' e x or x' C x to describe 
the incidence situation. However, in general the description of an incidence situation 
will involve specifically naming elements a of the set Q^(X)(x', x) of maps in A 
for which x' = xa. This is because in general it is necessary to consider “singular” 
figures x (i.e., those that are not parts) to do justice to the geometry Ga(X) of X, as 
seen from A in £ . For example, A itself may be shaped as a closed interval [0, 1], 
X may be a plane, and “.4-smooth” may mean exactly “continuous”. The figures in 
Ga(X)(A) are continuous curves in the plane, including many that arc parts of the 
plane but also many loops x for which x(0) = x(l) and hence x is “singular - ”! This 
latter is actually an incidence relation, for we can include in our little category A also 

the space A' consisting of one point. A figure of type A' in X is then just a specified 

a ° 

point x of the plane. There are two maps A a £ A of particular interest here. 

Exercise 10.21 

Define ao, a\ so that the incidence relations in Ga(X) 

XCfo = X 
xa\ = x 

are equivalent to saying that the curve x is a loop at the point x'. 0 


Exercise 10.22 

If A C £ and X — ^ Y is an arrow of £ show that there is an induced .4-smooth 
map of right .4- sets: 


Ga(X) G a (Y) 


0 


Exercise 10.23 

Let A op be the opposite category for A, i.e., the category with the same objects as 
A, but with xl op M . A') = A(A', A) and composition derived from A's. Show that 
a right .4-set is the same thing as a left _4 op -sct and further that the category of right 
.4-sets and all .4-smooth maps is exactly <S -4op . 0 

Briefly for right .4-sets the action of A is contravariant, whereas the basic induced 
.4-natural maps are covariant, (opposite to the situation for left .4-sets). 
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Of both left and right Al-sets there is a supply of especially fundamental examples 
whose construction does not require imagining a concrete enveloping category £, 
namely, we can consider A(X) and Qa(X), where X is an object of A itself. These 
examples arc usually referred to by names like “representable functors,” “regular 
representation,” or “self-actions by translation”. Of course, if ^4. is a monoid this 
supply of examples is limited to one, even though the categories of left A-scts and 
right A-sets are still quite vast and varied. But the representables are very useful 
probes in studying the general actions. 

Exercise 10.24 

(Yoneda’s lemma) For any category A, let C be an object of A and let $ be any 
right A-set, i.e. a family of sets parameterized by objects of A and a family of 
set-maps between these contravariantly parameterized by the maps of A, satisfying 
the associative and unit laws 


xl — x 

x(aa') — (xa)a' 


Then any A-smooth map 


Ga(C)^ 




is actually 


< p — x 

def 

for a uniquely determined element.* of <F(C) (figure in T> of type C). Here x(c) = xc 
for any A C. Further, the action in <£> can be seen as composition of A-smooth 
maps: xa = x a for C' -A>- C. 0 

Exercise 10.25 

(Cayley) Every abstract monoid A can be realized concretely. 

Hint: Take £ = 5" 4op , and take for A the (essentially unique) representable right 
^l-set; apply Yoneda’s lemma with = C = A. 0 

Exercise 10.26 

(Dedekind, Hausdorff) Every abstract poset A can be realized concretely as 
inclusions. 

Hint: Take £ = <S‘ 4op and consider the particular object 1 such that 


1(A) = a one-element set for all A in A. 



176 


Models of Additional Variation 


Then there is a unique part 

Ga(C) c 1 

in 8 for each C in A, and by Yoneda 

B — s~C 
in A 

If $ is an arbitrary right ,4-set, then the action of B — >■ C on an element x of $ at 

A op 

C is represented as composition (in 8 = S ) with the inclusion 


Ga(B) c g A (C) 


as parts of 1 
in 8 


Ga(C) 



(composition of x with an inclusion is in general often called "restriction” of a 
from the larger paid to the smaller). 0 

Exercise 10.27 

The action by restriction along an inclusion is not necessarily either surjective or 
injective. 

Hint: Consider the category A = [U — *-C] consisting of only two objects and 
only one nonidentity map. It is a poset. Let $(17) = the set of all continuous real- 
valued functions on the open interval (0, 1), but $(C) = the set of all real functions 
continuous on the closed interval [—1, 1], Show that the single restriction process 
$(C) — > $(17) is neither injective nor surjective. 0 

Exercise 10.28 

If A is just a set of objects (no nonidentity maps), then 

cS- 405 = S/A. 

0 


10.3 Reversible Graphs 

Let A be the concrete four-element monoid obtained as endomaps of the two- 
element set in the category of sets. The concrete action of A on 2 is a left action, 
and indeed the discussion of "winning teams” in Proposition 8.22 and Exercise 8.24 
concerns function algebras and homomorphisms relative to this category. However, 
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we want here to describe instead some examples of right actions of this same 
A. In this case we can visualize the right M-scts as (reversible) “graphs” in the 
following way. Suppose X is a set equipped with a right action of A = { 1 , r, co, c \ }. 
Since 


CtC j — C[ 

holds in A, one can show that there is a part Xq of the underlying set of X that can 
be described in several equivalent ways: 


Exercise 10.29 

Each of the following four conditions on an element x of X implies the others 


xc o = x, xc i = v 

3x'[x'cq = x], 3x'[x'ci — x] 


0 


Let Xq — ^-X be the paid of X whose members are all these special elements; 
then we have 


where r* is the map defined by the action 

r*x = xx 

Moreover, there exists a unique pair of maps do, d\ as in 

dp 

A'„- ’> Xv V 

C d, 

such that ido = co*, id\ = c\*, where c>*x = xck as with r. Then 

dox* — d\, d\X* — do 

The elements of X () are often called “nodes” or “vertices”. 

Definition 10.30: For A = {1, r, co, ci}, an element x of a right A-set is called a 
loop if and only if 


xc o = XCi 

An element x is called a one-lane loop if and only if 


xx = x 
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Exercise 10.31 

If x is a loop, then xr is also a loop. Any one-lane loop is a loop (but not conversely), 
and any member of X {] is a one-lane loop (but not conversely). 0 

The preceding calculations suggest a good way to picture an A-set X : 

Definition 10.32: Say that a -A- b in X if and only if 

xcq = a and xci = b 


(i Caution : X is not usually a category since we arc not given any composition rule, 
and hence x is not in itself a map since we cannot apply it to anything - on the other 
hand, we will sometimes consider various ways of enlarging X to get a category.) 

Thus, we can picture any right A-sct as a reversible graph. The general elements 
of X arc pictured as arrows that connect definite nodes as specified by the action of 
Co, ci- For a node a it is convenient to picture the one-lane loop ia as an arrow that 
is collapsed to a point, whereas all other arrows (including other loops and even 
other one-lane loops) retain length. Since 


it is convenient to draw x , x r next to one another like the two lanes of a two-lane 
highway. Thus, 


. leave foray 

enter i n 

is a very important example of a five-element right A-sct that has no two-lane loops. 
[The representable right A-set, the self-action, gets instead the following picture 

T TCi = C 0 ' 

. < — gy » since 

C 0 1 c i TCq = C\ . 

The five-element example will be seen to consist of the “truth values” for the 

,op 

whole category S . 

Exercise 10.33 

There is a unique _4-smooth (i.e., natural homogeneous) map cp from the repre- 
sentable right A-sct into the truth-value graph for which 


(p{ 1) = enter 
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There is another one i/r uniquely determined by 

xj/(\) = foray 

jop 

<p is injective and hence a part in S but xj/ is not injective on either nodes or 
arTows. 0 


Exercise 10.34 

Define (by a table) an action of A on a set of the right size so that the resulting 
yl-set can be pictured as the graph 



and do the same for 


0 


Exercise 10.35 

Define a surjective _4-smooth map p between the two ^4-sets above 


p_ 

S 

Determine all ^4-smooth maps s in the opposite direction and show that 

ps ^ id 

for all of them. 0 

Exercise 10.36 

Define a surjective ^-smooth map p “from the interval to the loop” 





Determine all Tl-smooth s in the opposite direction and show that ps ^ id for all 
such s. 0 
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The last two exercises illustrate that the failure of the axiom of choice is typical 
for variable or cohesive sets. 

10.4 Chaotic Graphs 

Several different kinds of structures are loosely referred to as “graphs”; for exam- 
ple, we have already briefly introduced “reversible” graphs. In this section (and fre- 
quently later) we will call “graphs” the reflexive, not necessarily reversible graphs, 
which arc just the right actions of the three-element monoid 

AJ = {Mo, Si} S i Sj=S i i, j = 0,1 

Exercise 10.37 

This monoid can be realized concretely as the order-preserving self-maps of the 
nontrivial two element poset. 0 

If X is such a graph, then for each element x, xS o is the beginning point of the 
arrow x, and x8\ is the endpoint. The points themselves are considered degenerate 
arrows, which are special loops. 

Definition 10.38: The arrow x is a loop if and only ifxSo — x8\ and is degenerate 
if and only if 

x8q = x = x<$i 

Exercise 10.39 

If x8q = x, then x is a degenerate loop (use the multiplication table for Ai and the 
associativity of the action). 0 

Exercise 10.40 

Write the action table for the seven-element right Ai-set pictured here 



X 


0 

A basic example of a graph is the right action of A i on itself, which is pictured 
as 


0 


1 
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with the correspondences 

u ~ 1, 0 ~ Sq, 1 ~ <$i in Aj 


Exercise 10.41 

If X is any graph and x in X, then there is a unique map of graphs (= Ai-smooth 
map) 



x 


X 


for which x(u) — x. 0 

We now want to discuss left- A \ actions. There is a huge difference between 
the left and right categories. The graphs have the capacity to encode arbitrarily 
complicated information, and thus the complete classification of graphs is probably 
not possible. On the other hand the left-actions can be completely classified almost 
immediately. Their interest is that they will give rise to a class of right actions, 
the chaotic graphs, which will include the recipients for many graph quantities of 
interest. 

In general, if T is a set with a given left action of a monoid A, and if V is any 
set, then the set 


T => V 

of all maps from T to V has a natural right action of A: 

( fa)(t ) — f{at ) for all t in T 

which defines fa as a new element of T => V for any given element f of T => V. 
The argument at (at which / is evaluated to get the value at t of fa) is given by 
the presupposed left action on T of a in A. 

Exercise 10.42 

For an iterated action f(a iaf) = (fa\)a 2 since both sides have the same value at 
each t in T. 0 

Now in particular a left action on T of the three-element monoid A \ means 
S j8jt = 8jt for i, j = 0, 1 


Exercise 10.43 

If <5 0 r = 8qs, then i)\t = 8 \ s and conversely. 


0 
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Hence, the orbits of a left action are just determined by either of the two equations 
in the exercise, and each of <$o, <$i is constant on each orbit. In other words, a left 
action with just one orbit is specified by an arbitrary set and a chosen pair to, t\ of 
elements, with 

Sot = to 
8\t = t\ 

for all t in the orbit. 

(Recall that the basic example of a left Ai action involved two constant maps, 
but the equations cannot force that to be taking place only on a two-element set.) 
The most general left A] -set is just a disjoint sum (no interaction between the 
summands) of summands, each of which is a left Ai set in its own right but with 
the special property of having a single orbit. Since 

k k 

is a general result, for our present purpose it will suffice to consider the case of a 
single orbit. However, 

Exercise 10.44 

If Jto(T) denotes the set of orbits (components) of a left A | -set T, then the action 
of So, Si gives rise to two sections of the canonical surjection T — Jto(T): 


T 


T 0 

The orbits arc the fibers; each of these contains two labeled points to, ti, but in 
some fibers the labeled points may coincide. An arbitrary left Ai-set is described 
up to isomorphism by giving the set I of components (/ = ito{T) above) for each 
i in I , giving the number Nj of elements in that fiber, and specifying the part I\ of 
the components in which to — t\ (rather than to f=- t\) is to hold; these are restricted 
only by I\ < I and 

Nj > 1 for all i in I 
Nj > 2 for all i not in A 



(/ itself can be 0.) 
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Returning to the construction of the chaotic graph T =>• V with a given set V of 
“values” and a given left Ai-set T with one component: the arrows arc arbitrary 
functions / from T to V, and 

(fs 0 )(t) = mo = /(to) 

(fsm = mt ) = m) 

define / 8q and f 8 1 as constant functions. 

Exercise 10.45 

The points of T =>■ V arc all the constant functions. Identifying these points with 
the elements of V, the starting and ending points of any / are identified with the 
values at t 0 and t\ of /, respectively. 0 


The intuitive reason for calling T =>■ V “chaotic” is that, for given points vq, v i , 

f 

any function T — >- V for which /(to) = ?’o and f(t i) = i>i is considered admis- 
sible as an arrow from i>o to V \ . One way of relating these very special graphs to 
more general, more highly structured ones, is indicated as follows: 


Exercise 10.46 

If X is any graph and if R is the chaotic graph R = T =y V derived from the value 

t 0 

set V and the left Ai-set of one component given by 1 g i T, then a graph map 

f ?1 

X R is determined by specifying a value fix) in V for each point of V and a 

function f x : T — s- V for each arrow x of X subject to these conditions: 


( 1) if a *">• b in X then 


t = to =>• fx{t) = f(a) 
t = h=> f x (t ) = fib) 


(2) If x — x8q = x8\ = a is a degenerate arrow of X, then f x is the constant func- 
tion f x (t) = f (a) for all t in T . 


Among the chaotic graphs T =>■ V arc the codiscrete ones determined by an 
arbitrary set V and by the extremely special two-element left A i -set T in which 
to f 1 t\ . In other words the codiscrete graph on V points has as its arrows all ordered 
pairs of elements of V with 

(vo ,vi)8i = (Vi,Vj) for/ =0,1 

That is, the projection and diagonal structure on the cartesian product V x V de- 
termines the codiscrete graph structure. The universal mapping property becomes 
simply the following: 
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Proposition 10.47: If V_ is the codiscrete graph with V points and if X is any 
graph, then any graph map 


X ->\' 

is entirely determined by a map Xq — - V of sets, where Xq is the set of points of 
X. That is, the restriction 

(X => A °p V) - (X 0 =>■! V) 
is an isomorphism of abstract sets. 

Proof: No matter how many arrows a x ^b there are in X, fix) can only be the 
arrow (/(a), /(/?)} in V, independently of x. ■ 

Corollary 10.48: There is a graph 2 such that for any graph X, the graph maps 
X — ^2 are in one-to-one correspondence with the arbitrary subsets of the set of 
points (vertices) of X. 

Proof: The set 2 of two elements is the truth-value object for the category of abstract 
sets. So consider the codiscrete graph 2 with four arrows and two points, and apply 
Proposition 10.47. ■ 

One can similarly “classify” sets of arrows (not just of vertices). 

There is a completely different two-element left Ai-set T, namely, the one in 
which 8 0 t = <$i t for all t; in other words to = h, but there is one other element t 
that is not a value of either <5o or Si. 

Exercise 10.49 

For the left A ] -set T just described, the graph T => V, for any given set V, satisfies 

T=>V = V x V 

the sum of V noninteracting copies of a graph V that has one point and V loops. 
(Under the isomorphism the “name” of the degenerate loop depends on the sum- 
mand.) 0 

Note that a “pointed set” can be viewed as either a graph or a left A | -set. This 
can be explained as follows: 
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Exercise 10.50 

If A is a commutative monoid, there exists an isomorphism of categories 

left A-sets ~~ right M-scts 

Note: An isomorphism of categories is a functor that has an inverse, which means 
a functor in the opposite direction so that both composites of the two functors are 
identity functors. 0 

Exercise 10.51 

There is a commutative monoid {1, e} with two elements in which e 2 = e and a 
surjective homomoiphism (— functor) of monoids 

Ai — >{1, e} 

Hence, any left (respectively right) { 1 , e}-sct has (via the homomoiphism) a natural 
left (respectively right) A i action. 0 

Exercise 10.52 

In the notation of the preceding exercise, any {1, e}-sct is uniquely a disjoint sum 
of pointed sets. 0 

Exercise 10.53 

A left A i -set arises by the process above from a pointed set if and only if it has a 
single orbit (of arbitrary size T), and in that orbit the two distinguished points are 
equal: f 0 — t\. 0 


Exercise 10.54 

A right Ai-set arises by the procedure above from a pointed set if and only if it is a 
graph with precisely one point (= vertex), i.e. if and only if it is connected and all 
arrows are loops. 0 

Exercise 10.55 

A graph is of the form T =>• V, where T is a left A | -set arising from a pointed set 
(as above) if and only if it is a graph consisting entirely of loops and in which every 
vertex is the site of the same number of loops, which number is moreover a power 
of the total number of vertices. More exactly, if T = E + {to = t\), then one can 
construct an isomorphism of graphs 


V x ( V E ) 


T => V 


0 
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10.5 Feedback and Control 


Can a graph ( 1 A , cp) of a map A — ->■ B be part of a graph structure in the sense 
of a right A] -set? One way of making this question definite is as follows: A right 
Ai-set P can equivalently be described as a pair of sets and three maps 


n 





for which ny = \ A = ify, that is, the “reflexivity” y is a common section for the 
“source” map tt and for the “target” map xf so that it chooses for each “state” a 
in A a particular - “arrow” ya in P, which is actually a loop, i.e. an arrow from 
a to a itself. The right A] -set structure on such a P is defined by p • = yjtp, 

P-Si- yxfp. 

Exercise 10.56 

Verify that these equations define a right Ai-set. Conversely, show that any right 
A i -set P gives rise to such a diagram by defining the state set A to consist of the 
trivial arrows, i.e. those p for which p ■ 8 o = p = p ■ 8\. Remember that not all 
loops are trivial. 0 

Exercise 10.57 

If A — B and y = (1 A , <p) is its graph, then on the set P = A x B we have part 
of a graph structure 

7T 

A — A x B 

■< — — 

f 

where jt = projection to A since Try = 1 A . (The map tp is recovered from its graph 
y by following instead with the projection to B.) Show that a map A x B — a- A 
completes the structure of a right Ai-set on A x B iff if (a, (pa) = a for all a. 0 

Given any A x B —A- A to be considered as “target ,” pairing it with the projec- 
tion 7T to A considered as “source,” we get a “graph” (not reflexive) in which the 
arrows are labeled by the elements of B with the property that given any b and any 
state a there is exactly one arrow in the graph with label b and source a ; the target of 
that arrow is a' — if (a, b). The elements of B act via if on the states, and the graph 
pictures the possible operation of the system; imagine that if b is activated when 
the system is in state a, the system will move somehow to state a'. The element b 
may be said to maintain a state a iff a = if (a, b ), i.e. if b labels a loop at a. The 
problem of completing the information if with a map <p whose graph y = (1, cp) 
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makes this graph reflexive may be a nontrivial one if the maps are to be in a highly 
structured category such as a category from linear algebra. 


Exercise 10.58 

Let A and B be vector spaces and let B — A be a linear map.We will consider 
some action graphs in which a is one of the ingredients. A simple example is 
essentially just the vector subtraction in A 

is (ci, b) = ct(b) — a 

Show that, with this action graph, b maintains a iff 


a(b) = 2 a 


0 


Exercise 10.59 

o 

Using a “feedback” map A ’ > B to “observe” the states permits one to adjust 
the acting element b before applying a, leading to an action called feedback 
control 


it (a, b) = a(b — ft(a)) 

Show that b maintains a with feedback control iff 

a(b ) = (1 + a/3)a 


0 


Exercise 10.60 

Given two opposed linear maps, the graph defined by feedback control i/r as above 
may sometimes admit a completion to a right A i action on A x B: this requires 
another linear map A — — B whose graph would supply the reflexivity; in other 
words, (p would supply, for every state a, a chosen b that would succeed in main- 
taining a. Show that a map <p works that way iff 

cnp = 1 + a/3 

Show that if solutions <p to the latter equation exist, then a is surjective (and one 
can take <p = a + /?, where a is any chosen section of a). But note that in most 
practical examples, dimA > dim/i, so that no surjective a can exist. 0 

The preceding exercise raises the question of calculating the possible reflexivity 

structures on subgraphs of a given nonreflexive graph A ; P . Thus, we consider 

f 
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where op — ly = rp, nh P — h A o, x/sh P — h A r. 

Exercise 10.61 

Show how to consider the diagram above as q*(Y)—^X, where X is the given 
A 5 ' P and where q is the inclusion functor between finite categories 


q 


where Ai is a category with seven maps all told, the bottom object is in two 
ways a retract of the top object, and the endomoiphisms of the top are isomorphic 
to Ai. 0 

Exercise 10.62 

For every X as above there is a well-defined reflexive graph (right Ai-set) qJ.X) 
such that for every Y there is a bijection 

q*Y — 5 - X 
Y — >q*X 

between (nonreflexive) graph maps above the bar and A i -maps (i.e., graph maps 
preserving given reflexivities) below the bar. In particular, each way of choosing 
a reflexivity structure Y on a part of X can be indicated by factoring across the 
canonical q*q*X — >■ X. 0 

Returning to the linear construction discussed in Exercises 10.5 1 through 10.59, 
we point out that it also is a functor. That is, we can consider diagrams 

p p' 

A B A' *B' 
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of lineal - maps as concrete structures of a certain abstract kind (see Exercise 10.63). 
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Exercise 10.63 

Define morphisms between such structures in order to obtain a reasonable cate- 
gory V of feedback-command data. 

Hint: A morphism should consist of a pair of linear maps satisfying two commu- 
tativity equations. 0 

Exercise 10.64 

Show that there is a functor T from V to the category Aff^ of nonreflexive affine 
graphs such that 

P ^ A 

T(A ^ — > B) = A f A x B 

« i/f 

with x[r(a, b ) = a(b ) — a as before. 0 

Exercise 10.65 

P 

Given any diagram X of shape A < > B in any category, consider the diagram 

B(X ) of the same shape 

1 B 

B < — B 

o 

Show that there is a natural map X — -s- B{X) of such diagrams (one leg of which 
is 1 a). In the linear case, determine the graph T( B(X)) and explain why the func- 
torially induced J-(fi) “labels” the airows in the graph T(X). 0 

Exercise 10.66 

Linearize and functorize Exercises 10.61 and 10.62. 0 


10.6 To and from Idempotents 

Some constructions starting from an arbitrary given map may produce diagrams 
involving idempotent maps e = e ■ e = e 1 . In most categories idempotents can be 
split, giving structures of the abstract shape 


E = 



wherein pi = 1, e = ip. Note that the category 2 — »- • | has the property that 

functors 2 — >■ A to any category A correspond exactly to moiphisms in A. 
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Exercise 10.67 

Consider the three functors 

p 

2 , - E 

e 

corresponding to the morphisms of the same names in E. Given any a in 5 with 
ct~ = er, each of p*(cr), i*(cr), e*(a ) — a is a map in S considered as a left 2-set. 

f 

Given any X >■ Y in S considered as a left 2-set M, there arc thus six sets 
equipped with idempotents that are universal with respect to maps to or from M. 
Show that these adjoints give specifically 

p\M = cograph of / 

p*M — kernel pair (equivalence relation) of / 
i\M = cokernel pair of /, gluing two copies of Y along X 
= graph of / 

all equipped with a natural idempotent. whereas 

e\M = cograph of / but with different adjunction map than p\M 
e, t M = graph of / but with different adjunction map than /*M. 

Determine also the other two associated canonical adjunction maps. 0 

Exercise 10.68 

Splitting the idempotents in a left action of A i yields a diagram of the form of a 
left action of A i , 

*0 

t p r x 

■SI 

with the two so, S] having a common retraction p. It is sometimes said that p 
“unites the two identical opposites” since the two subobjects so, si united in the 
one X arc identical (with T) as mere objects (i.e., upon neglecting the inclusions) 
but are often “opposite” in one sense or another. Because the two are subobjects 
of the same object, we can intersect them, obtaining the subobject so D S| of X. 
Since they also have a common domain, we can consider their equalizer k. However, 
the presence of p implies that these two constructions give the same result. That is, 
the natural map /, which proves inclusion in one direction. 


SoClSl 

t : .x 



Si 
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exists because stf — s\t — x implies that x e so and x G s\, but using p we can 
also show that for any x in X, if 

to proves x e sq and t\ proves x e s i 

then in fact to = t\ so that actually re j, thus defining an inverse for /. 0 


10.7 Additional Exercises 

Exercise 10.69 

In a poset A show that A < A' and A' < A implies A = A'. 

Exercise 10.70 

From a poset A may be constructed a strict poset A s as follows. There is an 
equivalence relation R on objects of A defined by ARA' iff A < A' and A' < A. 
A s has objects given by the codomain A s of the associated partition A-^~A S . 
There is an arrow B < B' in A s iff there arc A < A' in A such that Pr(A ) = B and 
Pr(A') = B'. 

(a) Show that A s is a strict poset and that p R is a functor (order-preserving). 

(b) Show that A s satisfies a universal property: If A '^B is a functor to a strict 
poset B . then there is a unique functor A s B such that F' p R — F. 

Exercise 10.71 

Show that S/X is the same as the category of left X-sets, where X is viewed as a 
category with no nonidentity arrows. Compare Exercise 10.28. 

Exercise 10.72 

Show that the category <S‘ 4 has finite limits and colimits. These arc all computed 
“pointwise”. For example, if and T are left yl-sets, then their product in <S' 4 is a 
left yl-set (Jx'l with (d> x = d>(A) x 'F(A). 

Exercise 10.73 

Show that the category <S' 4 has a natural number object. 

Hint: All of the A-actions in that object are identity. 

Exercise 10.74 

Show that the category S A has a truth value object 12. This is a generalization of 
Exercise 6.15. 
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Hint: As a functor £2 : A — > S , the value of £2 at A in A is given by the set of 
“subfunctors” of the representable functor Qa(X). 

Exercise 10.75 

Show that the category A' 4 has mapping sets. 

Hint: If $ and 'h arc left A-scts, then the functor d 1 ' 1 ’ : A — 5 - S has value at X 
given by the set of natural transformations from x G^(X) to 'I'. Write the precise 
formula for the A-action on the latter sets. 



Appendix A 

Logic as the Algebra of Parts 


A.O Why Study Logic? 

The term “logic” has always had two meanings - a broader one and a narrower one: 

(1) All the general laws about the movement of human thinking should ultimately 
be made explicit so that thinking can be a reliable instrument, but 

(2) already Aristotle realized that one must start on that vast program with a more 
sharply defined subcase. 

The achievements of this subprogram include the recognition of the necessity of 
making explicit 

(a) a limited universe of discourse, as well as 

(b) the correspondence assigning, to each adjective that is meaningful over a whole 
universe, the part of that universe where the adjective applies. This correspon- 
dence necessarily involves 

(c) an attendant homomorphic relation between connectives (like and and or) that 
apply to the adjectives and corresponding operations (like intersection and 
union) that apply to the parts “named” by the adjectives. 

When thinking is temporarily limited to only one universe, the universe as such 
need not be mentioned; however, thinking actually involves relationships between 
several universes. For example, the three universes (1) of differential equations, 
(2) of functions of time, and (3) of formal power series are all distinct with differ- 
ent classes of adjectives meaningful over each one. But there are key relationships 
between these three universes that are the everyday preoccupation of users of many 
mathematical sciences: a function might satisfy a differential equation, a power 
series might approximate a function or might formally satisfy a differential equa- 
tion, whereas we might seek to solve a differential equation or expand a function, 
and so on. Each suitable passage from one universe of discourse to another induces 
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(0) an operation of substitution in the inverse direction, applying to the adjectives 
meaningful over the second universe and yielding new adjectives meaningful 
over the first universe. 

The same passage also induces two operations in the forward direction: 

(1) one operation corresponds to the idea of the direct image of a paid but is called 
“existential quantification” as it applies to the adjectives that name the parts; 

(2) the other forward operation is called “universal quantification” on the adjectives 
and corresponds to a different geometrical operation on the parts of the first 
universe. 

It is the study of the resulting algebra of parts of a universe of discourse and 
of these three transformations of parts between universes that we sometimes call 
“logic in the narrow sense”. Presentations of algebraic structures for the puipose 
of calculation arc always needed, but it is a serious mistake to confuse the arbitrary 
formulations of such presentations with the objective structure itself or to arbitrarily 
enshrine one choice of presentation as the notion of logical theory, thereby obscuring 
even the existence of the invariant mathematical content. In the long run it is best 
to tty to bring the form of the subjective presentation paradigm as much as possible 
into harmony with the objective content of the objects to be presented; with the 
help of the categorical method we will be able to approach that goal. 

How much logic is actually relevant to the practice of a mathematical science is 
a question we can only scientifically answer after learning a significant fragment 
of this logic. Some use of logic is essential in clarifying the successive states of 
our calculations in the everyday algebra of ordinary variable quantities. In this 
appendix we will become familiar with some instructive examples of such use after 
making explicit the main features of this logical algebra, both objectively in terms 
of parts of universes of discourse as well as subjectively in terms of the syntax and 
symbolism commonly used to present the logical algebra that organizes the naming 
of these parts. As is customary, we begin with the propositional logic of a single 
universe of discourse and then proceed to the predicate logic where that universe 
is varied; the variation of universes introduces new features even in those cases in 
which it only involves passing from a given universe to a universe of ordered pairs 
or ordered triples. 

Note: In the foregoing preliminary remarks we have used the term “adjective,” 
whereas in the remaining text of this appendix we refer usually to “statements”; the 
link between the two is expressed by the traditional term “predicate” as follows: 
Over a given universe of discourse (such as functions) an adjective (or more gener- 
ally an adjectival phrase) such as “positive” corresponds to a predicate or statement 
form such as “is positive,” which can in turn become many particular' statements 
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such as 

1 H- jc 2 is positive 
e x is positive 

and so on; each thing or family of things (in the universe of discourse) for which 
the resulting statement is true belongs to the paid of the universe (e.g., the positive 
functions) the adjective specifies. 


Historical Note 

It is sometimes objected that logic is allegedly not algebra since - for example - 
nobody thinks in cylindric algebra. That is an unfortunate misunderstanding: cylin- 
dric algebras (and polyadic algebras) were an important initial attempt in the 1950s 
to make explicit the objective algebra briefly described above; however, they were 
excessively influenced by the styles of subjective presentation (of logical content) 
that had become traditional since the 1930s under the name of “first order predicate 
calculus” or the like. Those styles of presentation involved various arbitrary choices 
(such as the specification of a single infinite list of variables) that proved to be quite 
bizarre when confronted with ordinary mathematical practice; surely the logic of 
mathematics is not cylindric algebra. For about a hundred years now, mathemat- 
ical scientists have indeed had an intuitive distrust of attempts by some logicians 
to interfere with mathematics. However, some explicit recognition of the role of 
logical algebra is helpful and even necessary for the smooth and correct learning, 
development, and use of mathematics, including even the high school algebra of 
ordinary quantities. 


A.l Basic Operators and Their Rules of Inference 

Although the basic statements of mathematics are equations, many of the state- 
ments of interest have a somewhat more complicated structure; often this more 
complicated structure is expressed in the words that surround the equations, but 
in order to bring this structure into sharper relief it is useful to introduce a few 
“logical” symbols beyond just = (equals). These arc 

3, A, V, true, false (positive) 

V, =>■ (negative) 

used as operations to build up more complicated statements from simpler ones 
(usually stalling from equations as the “simplest,” although of course equations 
themselves can be complicated) and the symbol K which expresses that one state- 
ment follows from another. (The reason for distinguishing the negative or higher 
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operators from the positive ones is mentioned in Exercise A. 16.) Thus if A , B are 
statements, then 

A A B 

is the statement whose truth would mean that A is true and B is true, whereas 

A V B 

is the statement whose truth would mean roughly that at least one of A or B is true; 
thus, A, V are read simply “and,” “or”. If we express by 

CbD 

the relation that “D follows from C” (often called “C entails D”), then the essential 
rules of inference that govern the use of A are as follows: 

If C h D\ and if C b Do, then 

C bD, A Do 

and conversely, if C b Dj A Do, then both 

CbD, and CbDo 
This can be expressed more briefly as 

ChD,AD 2 iff C b Di, C b D 2 

or still more briefly as 

CbD, A Do 
CbDj, C b Do 

where the horizontal bar means “iff” in this context, i.e. if we know of three state- 
ments C, D\, Do that they satisfy the deducibility (or “following from”) relation 
above the line, then we can conclude that they satisfy the deducibility relations 
below the line, and conversely. Since 

CbC 

holds for any statement, we can conclude, taking the special case where C is 
D\ A D 2 , that 

Dj A D 2 b Dj 
D| A D 2 b D 2 

for any two statements D\ , Do, and that in turn implies the “top to bottom” half of 
the basic rule of inference above for A if we use the fact that “following from” is 
transitive : 

Whenever CbD, E b D, then also CbD. 
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To analyze the “bottom to top” half of the basic two-way rule of inference for A 
similarly, note that 

(a) (when we take the case in which D \ , D2 are both C), 

C b C A c 

for any C, and also that 

(b) (exercise!) whenever Ci b D\, C2 b hh it follows that 

Cl A C 2 I- D\ A D 2 

Taking the case of (b), where Ci, C2 are both C, and applying the transitivity 
of b to that together with (a), we obtain that 

whenever C b D \ , C b D2, then also C b D\ A Tb 

Since we already understood what A means, the detailed analysis above of its 
characterization in terms of b may seem pedantic; however, it is necessary to grasp 
every detail of it because we will apply exactly the same pattern of analysis to many 
other concepts that will not be quite so obvious. 

The statement “true” is roughly like the quantity 1 . It is characterized by the fact 
that 

C b true 

holds for any statement C. 

Exercise A.l 

The statements 

C b C A true C A true b C 

hold for any C. 0 

We often write C = C A true. Here A = B (read “A is equivalent” to B) means, 
as in this case, that entailment b holds in both directions. 

To say that a statement D is true means that it follows from true: 

true b D 


Exercise A.2 

Whenever 

true b D\ 
true b D2 

then 


true b D\ A D 2 


0 
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The relation of “false” to b is essentially opposed to that of “true”. That is, 

false b D 


holds for any statement D, and 

C b false 

means that C is false; for example 0 = 1 b false holds in any nondegenerate ring. 
The basic rule of inference for V is opposite to that for A: 

Ci v C 2 b D 
Ci b D, CobD 

Note that the comma below the line still means “and” in the metalanguage even 
though what we are describing is “or”. 

Exercise A.3 

Verify that the above rule of inference is intuitively reasonable; derive from it that 

C,-bC 1 vC 2 / = 1,2 

DvflbD 

holds for any Ci, C 2 , D and also that whenever Ci b D\, C 2 b Do, 

Ci V C 2 b D x V Do 

then use transitivity and the identity inference to rederive the two-way rule from 
these special consequences, i.e. analyze the V rule in the “same pattern” by which 
we analyzed the A rule. 0 

Exercise A.4 

(false v C) = C, for all C 0 

The operation =q> applied to a pair of statements B , D gives another statement 
B =>• D, which is usually read “B implies D” or “if B then D”. (It is to be disting- 
uished from BbD, which is a statement about statements usually written down 
only when we mean to assert that D in fact follows from B, whereas B =5 D is a 
statement which, like other statements, may be important to consider even when it 
is only partly true.) The rule of inference characterizing =► is not only in terms of 
b, but also involves A: 

Cb(B^T) 

C ABbD 

(This is sometimes called “modus ponens and the deduction theorem” by logicians.) 
That is, if we know that D follows from C A B, then we can conclude that B => D 
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follows from C alone (and also that C =>• D follows from B alone since C A B = 
B A C); conversely, if we know that B D follows from C, then we can conclude 
that D alone follows from the composite hypothesis C A B. In particular, since 
(true A B) = B, 

true b B =y D 
B\-D 

holds for any two statements B, D\ this motivates the frequent abuse of notation 
whereby we often write =E when we mean K This is an abuse that causes no major 
problems when no more than one implication is involved, but (for example) when 
we assert 

(B => D) b E 

we are not asserting that E is necessarily true but only that it follows whenever 
B =$> D is true, which in turn means .... But in particular the preceding discussion 
indicates the basic strategy for proving the truth of statements of the form B [)_ 
namely, we temporarily assume that B is true and then try to show that D follows; 
if we succeed, then we have proven the truth of B =y D with no assumptions, for 
that is what B =y D means. 

The =► operator has many properties: 

Exercise A.5 


true 

b 

(B^B) 

for all B 

B A (£=►£>) 

b 

D 

for all B. D 

(A^B)a(B^C) 

b 

(A=>C) 

for all A, If C 

if B\ b B 2 

then 

(A =>• Si) b (A =>• B 2 ) 

for all A 

if A 2 bAj 

then 

(Ai B) b (A 2 =► B) 

for all B 

((CaB)^D) 

EE 

(C => (B => D)) 

for all C, B, D 

B (D\ A Df) 

= 

(B^D { )a(B^D 2 ) 

for all B, D u Ih 

(B\ V B 2 ) =► D 

= 

{B\ =y D) A (B 2 =► D) 

for all Bi , B 2 , D 

( true =► D) 

= 

D 


( false =► D) 

= 

true 


(D => true ) 

= 

true 



0 

The expressions above are all to be proved using the basic rules of inference 
for =f. The presence of the operator =► satisfying its rule of inference enables us to 
prove an important “distributivity” between “and” and “or” that does not explicitly 
mention =► but which cannot be proved without it: 

A A (Bi V B 2 ) = (A A Bf) V (A A B 2 ) 
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Recall that the equivalence “=” is a metastatement (statement about statements) 
whose assertion means that in fact b holds in both directions; using =>• we can 
define an analog among statements themselves: 

B D 

is an abbreviation that stands for the compound statement 

(B => D) a(D => B) 

Often B ^=> D is read “B iff D,” and the strategy for proving its truth has two parts: 
first temporarily assume B and deduce D; then forget that, temporarily assume 
D, and try to prove B (this second part is called “proving the converse” since, 
independently of their truth, D => B is called the converse of B =► D)\ only if we 
succeed in proving both of these do we have a right to assert the <=>■ statement, 
i.e. only then have we proved B •<==>• D to be true. 

The reader may have noted the absence, in the exercise listing various properties 
of =£-, of any simple equivalence for statements of the form (B =>• false). That is 
because the abbreviation 

~^B = (B =>• false) 

defines another important logical operation read “not B” roughly speaking because 
the statement denying B is essentially the statement that assuming B would lead 
to a contradiction. From the basic rules of inference we can prove the following 
properties of 

Exercise A.6 

( B A -'B) = false 

if (B A C) = false then C I <B 

‘(Bi V Bf) = (-B 0 A (-B 2 ) 

(B -D) = -■(B A D) 

B b -1-1 B 

false = -i true 

true = -i false 

0 

Exercise A.7 

If we suppose (B V -> B) = true for all B (this “law of the excluded middle” is not 
valid in all systems of statements of interest in mathematical analysis), then we 
further have 

-i-iB b B 

— i(B l A Bf) = ( — 'Bi ) V (-'Bi) (de Morgan’s law) 
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Of course, if B is an equation x — y, then we usually write 

~'B = x y 


0 


Remark A.8: It is often useful to introduce another logical operation B\A called 
“logical subtraction” and read “ B but not A” characterized by a rule of inference 
dual to that for implication 

B\A b C 
B b A V C 


Exercise A.9 

Logical subtraction satisfies a whole list of properties roughly opposite to those 
satisfied by =>■. In case the law of excluded middle holds, we could define logical 
subtraction as ( B\A ) = B A (A =>■ false). However, in general \ may exist when => 
is impossible and in other cases =>■ may exist but \ is impossible. In terms of \ we 
can define a different negation operator A' = (true \A), but even when both oper- 
ations ( )', — ■( ) are present A’ may be different from ->A. (Example: the A’s, B' s, 
etc., denote real numbers between 0, 1, and b denotes < of real numbers.) 0 

Exercise A.10 

If we define 

9 A = A A A’ 

(read “boundary of A”) in a system where \ is present, then from the rule of inference 
follows 

9(A A B) = ((9 A) a8)v(Aa dB) 

a boundary formula of topology that is important for the boundary-value problems 
of analysis and physics (note the similarity with the Leibniz rule for differentiation 
of quantities). 0 

To specify the role of the existential quantifier 3 and the universal quantifier V it 
is necessary to be more specific about the kinds of statements we arc considering. 
Consider two universes X, Y of things (points, bodies, quantities, etc.) being talked 
about and a mapping / between them: 

X -^-Y 

Assume that there are two systems of statements both having some or all of the 
logical operations previously discussed; one system refers to the things in X, the 
other to the things in Y. Thus, 


Ab z C 
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means that both A, C refer to the things in X and that whenever A is true of 
something in X , then C is also true of the same thing; this is frequently symbolized 
by the use of a variable x to denote the things in X, and A(x) signifies that .r satisfies 
the statement A. Then 


Mx ) \- x C(x ) 
means the same as 
Ab C 

i.e., any x in X that satisfies A also satisfies C. Similarly, 

B by D 

means the same sort of relation between statements about the things in 7. Now the 
role of the given mapping / is this: it associates to every thing x in X a specific thing 
fx in Y . Hence, to any statement B about the things in Y there is a corresponding 
statement Bf about things in X 

(Bf)(x) = B{f{x)) 

by definition. This process is often called “substitution” or “inverse image” along /. 
For example, if f(x ) = v 2 and B(y ) = (y — 1 = 10), then ( Bf)(x ) = (x 2 — l = 
10), whereas if D(y) = (y + 1 — 0), then ( Df)(x ) = (x 2 + 1—0). It is easy to 
see that 


if B \- Y D then Bf h x Df 

for any statements B , I) about things in Y . Thus, the process of substitution of / into 
statements goes “backwards” relative to /, i.e., it takes statements on the codomain 
Y of f back to statements on the domain X of /, and moreover it “preserves 
entailment K” i.e. maps statements in fact related by \- Y to statements related 
by h x . 

Both operations V / and 3 / by contrast are “covariant,” meaning that they go in 
the same direction as /. Thus, in many situations, for any admissible statement A 
on X, the statement 3 f A will be an admissible statement on T, where 3 /■ is related 
to substitution along / by the following rule of inference 

3/A hy B 
A Bf 

for any admissible statement B on 7, or using the “variable” notation, 

(3/ A)(y) h B(y) for any y in 7 
iff 

A(x) b B( f(x)) for any x in X 
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This rule of inference is precisely the rule for carrying out calculations on the basis 
of the meaning of 3 /A, which is 

{ “there exists something x in X 
that satisfies A{x ) and 
goes to y under / : f(x) — y” 

and is clearly a statement about y’s rather than x’s. For (going from below the line 
to above) if A, B are related by A (x ) \- x B(f(x)) for all x, then to infer B(y) for 
a given y, it suffices to know that we can find some x for which y = fix) and for 
which A(x) holds; (going down) if conversely the existential statement entails B 
on Y for all y. then the entailment below the line must hold for all x. 

The expression 3 fA names the image of A along /, denoted /(A). 

Exercise A.ll 


If A[ \- x A 2 

then 

3 fA\ \-y 3 f A 2 

3 f falser 

= Y 

falsey 

3/(Ai V Af) 

= Y 

(3/Ai) V (3/A2) 

A 


(3/A)/ 

LU 

'-h 

bo 

| — Y 

B 


0 

Similarly, the universal quantification V / along / is characterized by its relation to 
substitution along /, which is the rule of inference 

B hy V f A 
Bf h x A 

for all statements A on X and B on Y. This rule governs the calculations appropriate 
to the meaning 

(V/A)(y) = “for all things x in X for which fx = y, A (x ) holds” 

Exercise A.12 

Formulate and prove the basic properties of V f that are dual to those stated in 
Exercise A. 1 1 for 3 /. ^ 

We now give three basic examples of mappings / along which quantification is 
often performed. First, suppose Y = 1 is a universe with only one thing but X is 
arbitrary. Then only one mapping / 
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is possible. In the category of abstract, constant sets (which one usually refers to 
as “the” category of sets), 1 has only two subsets, so that up to equivalence there 
are only two statements on 1, “true” and “false”. Their inverse images under the 
unique / are true x and falser, but on X there are in general many more statements 
A(.x) possible (except if X is the empty universe, in which case true x = false x ). A 
general A really depends on x in the sense that A(xo) may hold for one thing xo in 
X, and A(x\) may not hold for another thing x\ in X. But 3/ A no longer depends 
on anything; it is either absolutely true (in case A(x) holds for at least one x) or 
absolutely false (in case A(x) fails for all x); in other words 

3 /A= i truei, A ^ false x 
3fA =i false i, A = false x 

For a more general and typical example, suppose Y is an arbitrary set (or “universe 
of things”) but that we have another set T and define X to be the cartesian product 
T x Y. Then, a simple choice of / is the projection map 



Y 


TxY 


f(x) = yiix=(t, y) 
i.e. f(t,y ) = y all (t,y) in X 


where we recall that everything x in X is uniquely expressible in the form. r = (t, y), 
where t is in T and y is in Y. Now if we consider any admissible statement B on 
F, we have 


(Bf)(t, y) = B(y ) 

i.e. Bf is the “same” statement as B but is now considered to depend (vacuously) 
on the additional variable t. On the other hand, a typical statement A on X really de- 
pends on both t and y (and for that reason is often called a “relation” between T and 
F), but its existential quantification along the projection / no longer depends on t : 

(g/AXy) h y B(y) 

A(t, y) \~txy B(y) 

Usually, a notation for / is chosen emphasizing what it forgets (namely t) rather 
than the rest of x (namely y) that it retains; this is because, when given y, the 
existence of an x that maps to it by the projection is equivalent to the existence of 
a t such that the pair (t, y) has the required property, since x for which /(x) = y 
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is uniquely of that form x — (t, y) for a t uniquely determined by x (of course t is 
not uniquely determined by y, except for special A). Thus, traditionally one writes 

3tA = 3/A 

for / the projection on a cartesian product that leaves out t, and A a statement on 
that product, or (slightly confusingly) with all variables displayed 

3tA(t,y ) = y (3/A)(y) 

where f(t, y) — y. The left-hand side depends on y but not on f, one says that the 
operator 3 1 “binds” the variable t just as a definite integral /„ f(t)dt is merely a 
number, not a function of t. Similarly 

VtA(t,y) = Y (V/A)(y) 

where / is the projection fit, y) = y and A is a statement on the entire cartesian 
product X — T x Y since to say that all x for which fx = y have property A 
(y being given) is in this special case the same as to say that all t, when paired with 
y, satisfy the relation Aft, y) owing to the nature of /. 

Our first example is a special case of this second one, since if Y — 1 then 
X — T x Y = T. A further important subcase of this second example is as fol- 
lows: suppose Y — T 2 = T x T\ then X = T x Y = T x T 2 — T 3 , and the same 
projection / is now 


f(h, H, tf) = (t i, tf) 

The statements B on Y arc binary relations on T, whereas the statements A on X 
are ternary relations on T . Quantifying along this / (either V or 3) a ternary relation 
gives a binary relation. As an example of how this is used in expressing properties 
of particular binary relations B on T (i.e., statements on T 2 ), let us consider a 
particular one we will denote by < : 


t\ < h =TxT B(tl, h) 

The notation suggests that we might want to consider whether B is “transitive”, a 
condition usually written as 


h < h a h < h =» h < h 

As mentioned before, we do not really need the complication of the operator 
to assert something as simple as this; the h relation on statements is good enough, 
but which h? Clearly, it must be the I -3 between statements on A = T 2 since 
the left-hand side of the implication involves three valuables, but then what is the 
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meaning of the two sides when all we were given concretely to speak about was the 
binary relation B , that is, a certain statement on Y — T 2 ? There are three different 
tautological /’ s that enter essentially into the construction even if we do not always 
explicitly mention them, 

X = T 2 ^T 2 = Y 

namely, we define 


Ttfit u to, to) = (ti,t 2 ) 
n\(h, t 2 , t 3 ) = {to, tf) 
n 2 (h, to, to) = {tutf) 

following the custom to name the projection for what it leaves out.Then with B 
denoting the given < relation on T 2 , 


(Bn)(t u t 2 , to) = t x < t 2 

{Btz l)(tl , t 2 , to) = to<ti 

are both statements on X = T 2 so that the A operator for such statements can be 
applied by forming 


(Bjto, a Bn){ti,t 2 , to) = (t i < tf) A (to < to) 


and the result compared with the other Bn 2 . The expression 


Bit 3 A Bjt\ I-3 B jt 2 


is then the entailment relation signifying that B on T 2 is transitive. But now noting 
that Btt 2 is actually independent of the middle variable to, we can (by the rule of 
inference for 3 ) existentially quantify to get an entailment on T 2 , 

3 jt 2 (BjT 3 A Bjt) h 2 B 

which equally well expresses the same property of B ; or in variable notation, 

3r[t < r A r < s] \~ 2 t < s 

That is, to prove t < .v by use of transitivity is independent of any specific r between 
them, it is sufficient to know that “there exists” one. 
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Both of the preceding forms of the transitivity condition on B arc used in everyday 
reasoning; it is essential to know that they are equivalent. However, one could say 
crudely (and a similar comment can be made about each of the logical operators): 
although the determining rule of inference says that the operator is “not needed” 
when it occurs on one side of the K once the operator is determined it can also occur 
on the other side of the b; there it usually is essential and cannot be eliminated. For 
example, we could express C\~ D\ aDi without A just by listing C b D\ , C b Do, 
but A A B b C expresses (except in trivial cases) that A , B must be jointly assumed 
to deduce C; of course in that situation we can “eliminate” the A in favor of 
the =^, A b (B =► C), but we can in general go further: i.e. having introduced =£>, 
we have to admit the possibility of conditions like (B^C)bF, but there is no way 
in general to eliminate this occurrence of =►. Similarly C\ V Co b D is equivalent 
to listing C, b D, but a condition like C\~ D\ V D 2 cannot be expressed in general 
in terms of simpler operators. For example, the condition that a transitive binary 
relation < be a linear order 

true r 2 \-ji \t\ < to V t2 < ti\ 


which is just 


true bBv Bo o(t\, tf) = (b, h) 

in the notation used above, cannot be expressed in terms of equations or simple 
logical operations such as A but requires the use of V. Now a similar remark 
involving 3 concerns the statement that a transitive relation be “dense”. If we 
consider a transitive binary relation 

t < s = D(t, 5 ) 

(using different symbols to distinguish from B) it is said to be dense if (in variables) 

t < s b 3r [t < r A r < 5 ] 

i.e., (roughly speaking) if more things can always be found between given things 
(relative to D). Without variables, this says 

D bi 3jt2[Djt2, A Dtt\\ 

For example, this property holds in the universe of real numbers but not in the 
universe of whole numbers with the usual interpretation of D. So long as we 
consider only the “structure” T, D by itself, and hence only statements that can be 
built up logically from D , there is no way to eliminate the 3 (since it occurs on 
the right hand side of b). The example of real numbers shows, however, that if we 
have at our disposal additional structure, in particular if we can make statements of 
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the type 

t T s 

F(r, t, s) = r — 

2 

we can (using equational axioms, etc., about F) prove outright that if t < s, then 

t + s t + .v 

t < A < S 

2 2 

from which the 3r . . . could be deduced since we already have something more 
explicit. 

f 

As a third example, we will consider the case in which X — >■ Y is the “diagonal” 
mapping 

xAlxX 


defined by 


A(x) = (x,x) 

Then, for any binary relation 8onX (statement on X 2 ) 

(8A)(x) = B(x, x) 

defines a unary relation (statement on X). Special interest attaches to 

E = 3 A ( true*) 

the binary relation on X obtained by existentially quantifying the trivially true unary 
relation along the diagonal map. By the rule of inference for 3, 

E\~2 B 
truex hi BA 

or in terms of valuables, 


E(x i, x 2 ) h B(x i, X 2 ) 

holds if and only if B is a binary relation whose diagonalization is outright true, 
that is, 


truex h B{x,x) 

To obtain a class of B with that property, consider any (unary) statement A on A 
and consider 

B(x u x 2 ) = (A(.n) =► A(x 2 )) 

i.e. B = (Api =>■ Ap 2 ), where pfx i,xf) — Xj i — 1,2; 
then 


(BA)(x) = ( A(x ) =► A(x)) 
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which is identically true since any statement truly implies itself. Hence, by the 
preceding fragment of the rule of inference for 3, 

E\- B 

proving 

E(x i, xf) b (A(vi) =► A(x 2)) for any A 
which is the usual rule of inference for equality E, 

Xi = X2 E(x 1 , xf) 

the rule being often called the rule of substitution for equality. Using equality E , 
we can make explicit the usual distinction between our B and D by 

E b 2 B 

E A D \~2 falser 

which express that B is reflexive whereas D is antireflexive (stronger than merely 
being nonreflexive). Although antireflexivity is thus expressed using only A and 
false, we could introduce the somewhat more complicated operator =>• if we wished 
by quoting its rule of inference, yielding 

D b (E =y false) 

By the definition of “not,” we see that (when we can use the operator =^>) antire- 
flexivity of D is equivalent to 

D I 'E 


or with valuables 


t\ < t2 b t\ 7^ U 

or if one prefers (since we used =► anyway) 

true b t\ < ? 2 => 7^ b 

Finally, since one usually treats the “true b” as understood when asserting 
something, 

t\ < t2 =>• t\ t2 

An extremely important use of the equality relation E is in the definition of the 
unique existential quantifier that establishes a link between the general relations 
expressed by “statements,” as discussed above on the one hand and the well-defined 
“operations” of algebra and analysis on the other hand. First, we consider uniqueness 
(without commitment about existence). If A is any statement on T, we want to ex- 
press the idea that at most one thing in T satisfies A. This would mean that whenever 
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t \ , to both have the property expressed by A, then in fact t \ , to denote the same thing 
(the traditional example is Aft) = “t is the king of France at a given time”); in other 
words, 

A(ti) A Aft 2 ) F t\ — to 

where the entailment b is obviously supposed to be on T x T . It is important to 
allow additional variables, so we generalize this to 

Mh, y ) A A(t 2 , y ) b t 2 xy h = h 

where the right-hand side must mean En, when the map T 2 x Y — Y is the 
obvious projection, and the left-hand side is A7 Ti a A7r2 where the maps 

y^2 x Y 711 J 1 X Y 

no 

are again obvious once we recognize that there are two of them. Finally, it is very 
important to be able to consider this uniqueness property of A independently of 
whether we know it is true, so we use =► instead of b to define a new statement , 
“uniqueness in T of A,” 

Unj-A = 'i 7T [An 2 A An i =>• En] 

which is a statement on Y whenever A is a statement on J x Y. With variables, 

(Un r A)(y) = Vfi, t 2 [A{t x , y) A A(t 2 , y) h = t 2 \ 

For example Aft , y ) might have the meaning “ t is a solution of a certain differential 
equation with initial value y”. Then, to prove Un-/ /t for y would mean that any 
two solutions of the equation having initial value y are equal (as functions); this is 
a separate issue from that of whether there are any solutions starting from y, and 
both are independent (unless we know more about the differential equation) from 
the issue of whether existence or uniqueness holds for some other y. 


Exercise A. 13 

Let T = 1R = Y, i.e. both universes arc the set of all real numbers. Consider the 
relation 


Then 


For example, 


Aft, y)= [y = tft - 1)] 


b (Un r A)(y) y < 


(UnrA)(0) b false 


(Draw graph.) 


0 
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We have already discussed 


(3M)(y) = 3 tA(t, y ) 


and so we can define 


3! jA = A A Un^A 


a statement on Y, for any statement A on T x Y , written in variables 
3 \tA(t, y) = 3 t[A(t, y)] A Vfi, t 2 [Aft\) A A{tf) =Y t\ = £ 2 ] 
and read “there exists a unique t for which Aft, v)”. 


Exercise A.14 

With the notation of the previous exercise, 

b (3 t -A)fy) y > -- 

and 

b (BIj-AXy) <=► y = ~^ 

(Refer to the graph of the function defined in the previous exercise.) 0 

Now if Y — U- T is any given mapping, we can define a statement G on T x Y 
(often called the “graph” of g ) by 

G(t,y) = [t =g(y)] 

Then it will be true (as a statement on 1) that 

Vy3 ltG(t,y) 

Conversely, (by the very meaning of “arbitrary” mapping) if we have any statement 
G on T x Y for which the above V3 ! statement on 1 is true, there will be a mapping 
Y " >■ T whose graph is G. What will be its value at any given thing y in T? It 
will be “the” t (justified by the ! clause) for which G(t, y) (which exists by the 3 
clause in the 3!). 
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Exercise A. 15 

If G is a statement on T x Y and H is a statement onl x T, then their composition 
is defined to be the statement X x Y expressed by 

3t[H(x, t) A G(t, y)] 

This is actually 3 T [ Hu A Gjt ], where each of the three tt’s arc different mappings 
with domain X x T x F.A binary relation on T is transitive if and only if it follows 
from its composition with itself. If G, H are graphs of mappings Y — >■ T, T — s- X, 
then their composition is the graph of a mapping Y — >■ X. 0 

Exercise A.16 

The condition that G be the graph of a mapping can be expressed without 
using V, =>• by using instead 3, A. In fact, we can use 3, A in the simple com- 
bination known as composition; namely, consider the transpose relation G* of G 
defined by 

G*(y, t) = G(t, y ) 

and consider E T , E Y the equality relations on T and Y, respectively; then 


G is the graph of 

a mapping only if the two entailments 


E Y h V 2 G* o G 


GoG* I-t-2 E t 

hold 



where the small circle denotes composition. (In fact if for a given G there is any 
relation G* that satisfies these two entailments, then G* must be the transpose of 
G, as defined above, and hence G is the graph of a mapping.) This reformula- 
tion is important since 3, A (which go into the definition of o) are stable under 
many more geometrical transformations used in analysis than are V, => (which 
go into the definition of !). Moreover this latter kind of relation (expressed in the 
box) between “relations” and “mappings” will persist in situations in which the 
values of G(t, y) arc much richer mathematical objects than just “yes” and “no” 
answers. ^ 


A.2 Fields, Nilpotents, Idempotents 

Examples of Logical Operators in Algebra 

The proofs of many of the exercises in this section will be clearer if explicit rules 
of inference from the previous section arc cited at the appropriate points. 
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The most basic properties of algebraic structures such as rings, linear spaces, 
and categories are expressed by equations; for example, the distributive property, 
nilpotence, associativity, or the property of being a solution. However, in working 
with these equations we must frequently use stronger logical operators, both in 
stating stronger axioms on the ground ring in linear algebra and in summarizing 
the meaning of our complicated calculations. (It should be remarked, however, that 
most of this logic again becomes equational when we pass to a higher realm.) For 
example, using the logical symbol K which can be read “entails,” the additional 
axioms stating that a given ring R is a field are that R is nondegenerate, 

0=11- false 

(usually expressed by introducing “not” and saying 

true h 0 / 1) 

and that every nonzero quantity in R has a reciprocal, 

3y[xy = 1] 

When the law of excluded middle is valid, the latter is equivalent to the (in general 
stronger) condition, involving the logical symbol for “or,” 

x = 0v 3y[xy = 1] 

being true R (which has the virtue of being invariant under more geometrical trans- 
formations but the drawback, in those cases like continuous functions where the law 
of excluded middle is false, of being less likely to be true). Usually, one expresses 
this field axiom using V =>• as 

true h Vx[x / 0 =>■ Ely [ xy = 1]] 

with the understanding that the universe over which both x, y vary is R. Thus 
Z = {... — 3, —2, —1,0, 1, 2, 3 . . .} is a ring R that is not a field since, for example, 
5^0, but there is no v in Z for which 5y = 1. In any ring we can deduce purely 
equationally from the hypotheses 

xy i - 1 
xy 2 - 1 

that yi = yi (here is the deduction, using only [commutative] ring axioms and the 
hypotheses: 

yi = yti = yi(*y 2 ) = (yu)y 2 = (.xy i)y 2 = iy 2 = y 2 ) 

Therefore, we can conclude that in any ring, reciprocals are unique, and hence in 
any field that (using the exclamation point to signify this uniqueness) 


V.x[x ^ 0 =>- 3!y[xy = 1]] 
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Further (since R is nondegenerate if it is a field), “the” (just justified) reciprocal of 
A' cannot be zero either. 

Exercise A. 17 

If y is a reciprocal of x, then a is a reciprocal of y; if, in any given ring R, 0 has a 
reciprocal, then R is degenerate. In case R is a field, if we restrict the universe to 
the set G of all nonzero elements of R ( G is no longer a ring) the slightly simpler 
statement 

Vx3!y[xy = 1] 

is true over G. Since this is the criterion for the existence of a mapping, there is a 
mapping 



called the reciprocal mapping whose graph is defined by 

x • y = 1 

that is, y = x -1 if and only if x • y = 1. 0 

Exercise A. 18 

A much better way of understanding the last construction is as follows: Let R be 
any commutative ring (not necessarily a field, maybe even degenerate). Define G 
to be the subset of R consisting of all elements x of R satisfying 

3yfxy = 1] 

in R. Then there is a reciprocal mapping G — ^G, 1 is in G, and G is closed 
under multiplication; i.e., if xi, X 2 are in G, then X 1 X 2 is in G since l -1 — 1, and 
(X 1 X 2) -1 = X 2 xf . This means that G is a (commutative) group called “the multi- 
plicative group of R”. If 0 is in G, then R is degenerate. For any x in G, — x is also 
in G. But xi, X 2 in G does not imply x\ + X 2 in G. If R = 1R, the real numbers, 
then 1 + x 2 is always in G for any x, and the same is true if R — C{S ) = the ring 
of all continuous real-valued functions on any continuous domain (“topological 
space”) 5. 0 

Now the condition that a ring R be a field is just that R be the disjoint union of 
{0} and G, namely that (reading the V form of the definition backwards and using 
the symbol for “not”) 


3!x[->G(x)] 
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Since any ring R has a special element 1 and since R has an addition operation, 
there are elements in R that may as well be denoted 

2=1 + 1 
3= 1 + 1 + 1 


(not all of these need be distinct). Even if R is a field, not all of these need have 
reciprocals; for example, there is an important field with only three elements in 
all in which 3 = 0. However, a great many rings, even those that arc not fields, do 
involve 1R in such a way that all of the above do have reciprocals, which can be 
denoted as usual by . . . Thus, \ is in G, | is in G . . . , where G denotes the 
multiplicative group of any such ring R. 

Exercise A.19 

In any ring having 

true h Vx3y[y + y = x] 

By a subring of a given ring R is meant a subset S of the elements of R that 
contains 0, 1 and is closed under the addition, the subtraction, and the multiplication 
of R. Thus, if p is any polynomial with coefficients in Z in several, say three, 
variables, and if x, y, z are in S , then p(x, y, z) is also in 5. 

Exercise A.20 

If R is a ring having | and if S is a subset containing 0, 1 closed under addition and 
closed under the unary operations of multiplication by \ and by — 1 , then S is a 
subring if and only if S is closed under the unary operation of squaring. (The answer 
is a frequently used formula.) 0 

Now a subring is not necessarily closed under division, even to the extent to 
which division is defined in R. Thus, Z c 1R is a subring, but Z is not a field even 
though 1R is a field. But any subring of any field does have a special property not 
shared by all rings of interest, namely 


xy = 0 h [x = 0 V y = 0] 


Exercise A.21 

Prove the statement just made in any subring of a field. 


0 
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A nondegenerate ring S having this property for all x , y in S is called an integral 
domain. This is intimately related to the cancellation property 

Vxi, \ 2 [ax\ = ax 2 =b x\ — xfi 

for an element a, which (using subtraction) is easily proved equivalent to the “non- 
zerodivisor” property of a 

Vx [ax — 0 =>• x = 0] 

where all universal quantifiers range over all x, xi, xi in the ring in which we are 
considering a. We might call a “monomorphic” in that ring. Note that this property 
uses the logical operators in an essential way since, when we want to prove 

a is monomorphic b something else about a 

we cannot always eliminate the V, =q> implicit on the left-hand side. Of course, 
if the “something else” is just another instance of the cancellation property, such 
proof may present no problem. Now clearly a = 0 can not be monomorphic in a 
nondegenerate ring since 

ax = 0 • x = 0 

for any x, yet we could take x = 1 ; hence, it would not follow that x = 0. Now the 
idea of an integral domain is that (if the law of excluded middle is assumed) the 
only a that is not monomorphic in R is a = 0. That is, the validity for all a in R of 
any one of 

a 0 =>• Vx [ax = 0 =>• x = 0] 

3.x [ax = 0Ax^0]^a=0 
Vx [ax = 0 =>• [n = 0 V x = 0]] 

is equivalent (using the law of the excluded middle) to the condition that R is 
an integral domain. The last form with V is the one familiar - from high school as 
a crucial step in the method of solving polynomial equations by factoring. This 
method is used in proving 

Theorem A.22: In any integral domain, the equation 

x 2 = x 


has precisely two solutions. 

Proof: If x 2 = x then x 2 — x = 0, and hence x(x — 1) = 0 (since x(x — 1) = x 2 — x 
in any ring). Now use the integral domain property to get x = 0vx — 1 = 0, i.e. 
x = 0 V x = 1. We say “precisely two” because the ring is nondegenerate. ■ 
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Exercise A.23 

In any commutative ring, an element satisfying x 2 = x is called idempotent. If x 
is an idempotent, then so is its “complement” 1 — x . The product of any two idem- 
potents is an idempotent. If x and y arc idempotents and if xy = 0 (one says x, y 
are “disjoint” or “orthogonal”) then * + y is also an idempotent. One says a ring 
“has connected spectrum” if it has precisely two idempotents. In general, the idem- 
potents describe chunks of the “spectrum,” for example, of a linear transformation 
(which gives rise to a ring). 0 

Very important in engineering calculus [B99], in analyzing linear transfor- 
mations, and so on, are the nilpotent elements in commutative rings, where x is 
nilpotent if and only if 

3 n[x n+l = 0] 

Here the 3 n does not range over the ring we are talking about but rather over the set 
0, 1, 2, 3 ... of natural numbers, which act as exponents on elements of any ring 
(or indeed of any system wherein at least multiplication is defined). In more detail, 
we could say that x is nilpotent of order 1 if 

v 2 = 0 

while r ^0, that x is nilpotent of order 2 if 

v 3 = 0 

while x 2 f 0, and so on. Of course 0 is nilpotent of order zero. In a nondegenerate 
ring v — 1 is not nilpotent of any order. Using commutativity, we find that the 
product of a nilpotent with any element is again nilpotent. Again using commuta- 
tivity, we discover that the sum of any two nilpotent elements is nilpotent; however, 
the order may increase. For example, if v 2 = 0 and y 2 — 0, we can calculate that 
( x + y) 3 = 0; as for (x + y) 2 , it might be 0, but only in case xy = 0, which is not 
always true. Analysis of the calculation leads to the idea that, to be sure of the 
nilpotency of a sum, we have to add the orders of nilpotency of the summands: 

Theorem A. 24: Ifx" +l = 0, y'" +l = 0 in a commutative ring, then always 

(x + y) n+m+1 = 0 

Proof: In any commutative ring the distributive law implies the binomial expansion 

(x + y) p = Cij*'y J 

i+j=P 

for any x , y in the ring and any p in N (note that NcZ and that Z can be used as 
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coefficients in any ring - indeed in any system in which addition and subtraction 
are defined. In fact 

1 ilj ! 

is in N despite the denominators, where ! denotes “factorial,” by Pascal). Thus, the 
proof of the theorem reduces to the following fact about the elementary arithmetic 
of N: ■ 


Lemma A.25: i + j = n + m + 1 h [/ > n + 1] V [j > m + 1], 

Exercise A.26 

Prove Lemma A.25. 0 


In any case, since 

x n+l =x n -x 

is a product, it is immediately clear that 

Theorem A.27 : In an integral domain, the only nilpotent element is 0. 


An extremely important property (for analysis, linear algebra, computer science, 
etc.) is the following, showing that, although the existence of nilpotent elements has 
the “negative” consequences that some elements (the ones “near” zero) are definitely 
not invertible, it also has the “positive” consequence that some other elements 
(those “near” 1) definitely arc invertible, and there is even a specific formula for the 
reciprocals. 


Theorem A. 28: Ifh is any nilpotent element in a ring, then 1 — h has a reciprocal 
in the same ring. In fact ifh n+l = 0, then 


1 


1 - h 




k=0 


Proof: Calculate that the right-hand side, multiplied by 1 — h, gives 1. ■ 

Remark A.29: In a ring furnished with a notion of convergence, Theorem A.28 
can often be generalized to some h for which h" merely converges to 0 as n tends 
to oo, i.e. to small h’ s not necessarily so small as to be “nilpotent”. But the formula 
of the theorem is surprisingly often useful even just for the nilpotent case. 
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Exercise A.30 

If u has a reciprocal and h is nilpotent then u ± h has a reciprocal. (A formula, only 
slightly more complicated than that of the theorem, can either be deduced from the 
theorem as a corollary or calculated and proved directly.) 0 

Exercise A.31 

If u | = 1 — hi, U 2 = I —/72 are invertible elements of a (commutative) ring of 
the form indicated with hi, h 2 nilpotent (with orders of nilpotency n\, no say), 
the product ii\U 2 is of course invertible; is it again of the special form, namely 
“infinitesimally near 1” in the sense that 

U\U2 — 1 — h 

for some nilpotent h of some order? Start with the special case h\ = 0 = h\, 
h\) 72 = 0. What if h t — t,e, where e 2 — 0? What if multiplication were non- 
commutative? 0 

Remark A.32: (An Embedding) Any given integral domain R can be realized as a 
subring of a field F by constructing F to consist of equivalence classes of fractions 
-, where x is in R , s is in R , and s / 0. 

The condition that a ring R has “no” (i.e., no nonzero) nilpotent elements is often 
referred to in geometry and analysis by saying the R is “reduced”. It is more general 
than the cancellation (i.e., integral domain) property since, for example, R = 1R 2 
with coordinatewise multiplication is reduced (i.e., has no nilpotent element) but is 
not an integral domain since it has nontrivial idempotent elements (0, 1), (1, 0). In 
logical notation with variables, R is reduced if and only if 

3n[x n+l = 0] b x = 0 

holds for all x in R. Since the 3 occurs on the left, this is one of its eliminable cases. 
But more profoundly (i.e., using something of the quantitative content of the theory 
of rings and not merely logical form): 

Exercise A.33 

If a ring satisfies 

b Vx[x 2 = 0 =>• x — 0] 

then it is reduced. 

Hint: Show that if x" +1 = 0, then x 2 " — 0; hence, using our main assumption, then 
also x" = 0. By induction, the n can be decreased until eventually n — 1 . 0 



Appendix B 

The Axiom of Choice and Maximal Principles 


The axiom of choice was first formulated by Zermelo in 1904 and used to prove 
his Well-Ordering theorem. The axiom was considered controversial because it 
introduced a highly nonconstructive aspect that differed from other axioms of 
set theory. For some time it was mainly used in the form of the Well-Ordering 
theorem (which is actually equivalent to the choice axiom). In this formulation, 
the axiom permits arguments by the so-called transfinitc induction. For about the 
last 50 years it has usually been used in the form of the Maximal Principle 
of Zorn (published in the 1930s, though a version by Hausdorff was published 
earlier.) 

If nontrivial variation with respect to some category S of abstract sets and ar- 
bitrary mappings is allowed in a category of variable sets, the axiom of choice 
tends not to hold as we have seen in Exercises 4.54, 6.12, and Section 10.3. In 
fact, the axiom is valid in certain very special toposes of variable sets determined 
relative to a category of abstract sets and arbitrary mappings, as was mentioned in 
Section 4.6. 

Here we will demonstrate that Zorn’s Maximal Principle is equivalent to the 
axiom of choice. We will show this for forms of the Maximal Principle that use 
both chains and directed (or filtered) posets. The latter form is more suitable for 
arguments made in mathematical practice, whereas it is the former that we will 
see as a direct consequence of the axiom of choice. In addition we will consider 
Hausdorff’s Maximal Principle and some other consequences of the choice axiom. 
Our proofs of the maximal principles will follow directly from the famous Fixed- 
Point theorem of Bourbaki (B.15 below). 

To state the results we need a few definitions. First recall the definition of poset. 
Definition 10.6, that is a category with at most one arrow between any two objects. 
Any set of objects of a poset E determines a full subcategory, which is also a poset, 
called a subposet of E. 
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Definition B.l: Let S be a set of objects of a poset E. An object u in E is an upper 
bound for S if for any s in S, s < u. 

Definition B.2: Let S be a set of objects of a poset E. The object u is a least upper 
bound ( abbreviated lub) or supremum ( abbreviated sup) for S if it is an upper 
bound for S, and moreover for any upper bound u' for S we have u < u'. 

Dually we have definitions of lower bound and greatest lower bound (abbre- 
viated gib) or infimum (abbreviated inf) for a set S of objects of E. 

Definition B.3: Let D be a set of objects of a poset E. We say D is directed 
if it is nonempty and every two-element part of D has an upper bound in D, or 
equivalently 

Vx , y e D 3u € D x < u & y < u 


Exercise B.4 

Show that the power set VX of any set I is a poset with arrows given by the 
inclusion relation c. Note that VX is directed as a part of itself. Indeed, any part 
of VX has a sup. 0 

The next concept is important in the sequel. 

Definition B.5: An object m of a poset E is maximal if 

Vx m < x =>■ m = x 
A minimal object is defined dually. 

An important special case is 

Definition B.6: An object m of a poset E is a maximum if m is an upper bound 
for all of E. A minimum object is defined dually. 

Exercise B.7 

Show that a maximum is maximal. Show that a maximum object of a subposet S 
of £ is a sup of S. 0 

We can now state (the directed version) of 


The Maximal Principle of Zorn 
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If every directed part of a poset E has an upper bound , then E has a maximal object. 

We can immediately prove that Zorn’s Maximal Principle implies the axiom of 
choice. This proof is a very typical example of the use of this principle in modern 
mathematics. 

Theorem B.8: The Maximal Principle of Zorn implies the axiom of choice. 

Proof: We need to begin with an epimorphism e : A — — B in S and find a section 
for it. This section should be a maximal object in some suitable poset, for this is 
what Zorn’s principle provides. The objects of our poset E will be pairs (i, s), where 

i : Bj c 5 - B is a paid of B and s : B , — >- A satisfies Vb e i, es(b) = b. Note that 

it is immediate that such s must be mono. We call the pairs (i, s) partial sections 
of e. Partial sections of e form a poset E when < is defined by 

(i, s) < (i', s') iff i c i' & the restriction of s' to B , is s 

We will obtain a maximal object of E by showing that E has least upper bounds 
for directed parts and applying the Maximal Principle. It will turn out that such a 
maximal object provides a section. So let D be a directed paid of E. Define a paid 
io : Bd c f- B of B by 

VZ? b e z'd iff 3 (i, s) in D & b e i 

Define s D : B D — by 

s D (b) = sib) whenever b e i and (i, s) in D 

It is necessary to verify that s D is well-defined by showing that if/? e i and b e /', 
then s{b) = s'(b). This is where we use that D is directed: there is an upper bound 
</", s") in D for </, s), </', s'). Thus, / C i" and /' C /", and s(b) = s"{b) = s\b ). 
It is also obvious that (i D , s D ) is an upper bound for D since 

V(i, s) in D b e i =>■ b e io 

and so i c i D . 

Thus, E has a maximal object (i m , s m ). We claim that s m is the required section 
of e, that is, B m = B. If not, there is Z»o : 1 — >- B such that bo is not in i m , and so 

h = : B m + 1 c B 

is a paid of B. Since e is epi there is a : 1 — >-A such that e{a) = bo- Let 

^1 = : B m + 1 — 5 - A 

and then (Zi, \| ) is a paidial section of However, i m c /,„ + 1, but is not 
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included in i m (since bo is not in i m ), and this contradicts the maximality of i m . 
Thus, s m has domain B. ■ 

There arc many other uses of the Maximal Principle of Zorn. For example, it 
implies that any proper ideal in a commutative ring is contained in a maximal ideal 
(see Exercise B.34) and that any vector space has a basis. 

Definition B.9: A poset E is called a total order (or chain) if for objects x, y in 
E either x < y or y < x. A subposet C of E, which is itself a chain is called a 

chain in E. 

Exercise B.10 

Show that a totally ordered poset E is directed. 0 

With the definition we can state 

The Maximal Principle of Zorn (chain version) 

If E is a poset such that every chain in E has an upper bound, then E has a maximal 
object. 

Exercise B.ll 

Show that the Maximal Principle of Zorn (chain version) implies the Maximal 
Principle of Zorn. 0 

Notice that a proof very similar to that of Theorem B.8 will show directly that 
the axiom of choice is implied by the chain version of the Maximal Principle. 

Definition B.12: For any set X, Vx denotes the characteristic function £2 Y — £2 
of the one-element part of £2 X whose element is the composite X — 1 £2 

called truex for short. Thus 



is a pullback. 

Given any I ' Q x , corresponding to an I - parameterized family of parts of X, 
consider its transpose r F ; then, the composite V/ [r F] is the characteristic function 
of the part of X called the intersection of the family F. 
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We will use the higher dual distributive law of logic 

or Ra ) entails 3a(La) or Va(Ra) 

We will discuss below in Exercise B.26 the question of when this law is valid 
(it is certainly valid for any category S of abstract sets or any topos in which the 
axiom of choice is true). 

For simplicity we will assume that posets arc strict in the sense that x = y follows 
from x < y and y <x \ then, sups arc unique if they exist. 

Definition B.13: If f is an endomap of a set X, then a fixed point for f is an 
element x ofX such that fx = x. 

Definition B.14: If f is an endomap of the set of objects of a poset E, then an 
/-chain is any chain C in E such that x e C implies fx e C. We denote the part 
of the power set of the objects of E consisting of all f -chains by chf(E). 

Theorem B.15: (Bourbaki Fixed Point theorem 1950). Let E be a poset, and let 
f be an inflationary endomap of the set of objects of E, that is, for all x in E, 
x < f(x). Then f has at least one fixed point provided that any chain in E has a 
sup in E. 

Proof: Define a subsystem to be any part (of the objects of E) closed with respect 
to sups of /-chains and to / itself. Fet A be the intersection of all subsystems of 
E. Thus 


Af)f 


EQf 


and 


chf(A) 


ch f (E) 


sup 



sup 


arc commutative diagrams and A is the smallest part with those properties. Since 
we have allowed the part 0 in chf(E), it follows that A is nonempty and indeed 
has a smallest element 0. Because of the first diagram, any fixed point of A is also 
a fixed point of E. Thus, for the remainder of the proof we work in A. 

We need to establish that any subsystem (such as A) whose only subsystem is 
itself has the following two 

Elementary Properties of Irreducible Subsystems 

(P) t < a entails t = a or fit) < a 
(B) it is true that t < a or f(a) < t 
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These properties will suffice because they clearly imply the “/-trichotomy” 
property for A, 

t = a or f{a) < t or f(t) < a 

which in turn easily implies that the whole of A is an /-chain. (As often in inductive 
proofs, to reach the latter goal we needed to prove something stronger.) But A being 
an /-chain entails that sup A £ A since we have assumed that A is closed under 
such sups. Then sup A is a fixed point of / since sup A < /(sup A), yet x < sup A 
for all x £ A and /(sup A) e A. 

In order to establish the two properties above, define a paid of A: 

P = {a £ A \ Vt e A[t < a =$■ [t = a or /(f) < a]]} 

Then, more precisely, what we will show is that P = A. 

It will suffice to show that P is a subsystem, for then A C P by definition of A. 
To show that P is a subsystem, we first will need to show that for any given a £ P 
each of the following is a subsystem: 

B a — {b £ A | b < a or f(a) < b } 


Exercise B.16 

Because a £ P, B a is closed under /. 0 

Exercise B.17 

The subset B a is closed under any sups that exist in A. 

Thus B a = A, i.e. any a £ P is a “bridge point” for A. 

Hint: Use the higher dual distributive law. 0 

Both of the clauses needed to show that P is a subsystem will depend on the 
bridge-point property. 

Exercise B.18 

Assume a £ P . Use the fact that a is a bridge point to conclude that fa £ P. 0 

Exercise B.19 

Applying a different case of the higher dual distributive law to the fact that each 
element of P is a bridge point, conclude that P is closed under sups of /-chains. 

Hint: Suppose C is any /-chain in P and t is any element of A for which t < sup C ; 
then it must be shown that t = sup C or f(t) < sup C. But all elements of C are 
bridge points; hence, t satisfies 

Vo £ C[t < a or /(a) < f] 
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The higher dual distributive law can be applied by taking L,(a) to be “t < a” and 
R,(a ) to be “/(a) < t,” leading to two possibilities: 

(1) IfVfl G C\f(a) < t],thenV« G C[a < f], and hence sup C < t,i.e.t = supC. 

(2) Ift < a for some a G C, then since a G P , one has t — a or f (a ) < t: 

(a) if t — a, then f(t) = f{a), yet f(a) G C since C is an /-chain, and so 
fit) < supC; 

(b) if f(a) < t, together with t < a < f(a), we have t = f(a) G C, and so 
fit) < supC. 

Thus, in all these cases t — supC or f(t) < supC; since this is true for all t < 
sup C, it follows that sup C G P. 0 

Since the foregoing holds for all /-chains C, we have completed the proof that 
P is a subsystem. Therefore, P = A, and so, as shown above under “Elementary 
Properties of Irreducible Systems,” / has a fixed point. Thus, the exercise completes 
the proof of the Bourbaki Fixed-Point theorem. ■ 

Exercise B.20 

(Hausdorff 1914) The axiom of choice implies that any poset X contains a maximal 
chain (the Hausdorff Maximal Principle). Indeed, if not, let tp be a section for the 
first projection in 

[Proper inclusions of pairs of chains in X\ f chains ( X ) 

and let / be <p followed by the second projection. The sup of a chain of chains 
always exists and is again a chain. 

Note: This proof of the Hausdorff Maximal Principle evidently uses Boolean logic 
(which is actually a consequence of the axiom of choice, see [D75] or [Joh77]). 

0 


Exercise B.21 

(Zorn 1935) Any poset in which every chain has an upper bound has itself a maximal 
element. 

Hint: An upper bound of a maximal chain is a maximal element; indeed, if C is a 
chain, and if m is any upper bound for C and v is any element with m < x, then 
C U {x} is a chain; thus, if C is maximal, x € C and hence x — m. 0 

Combining this exercise with Theorem B.8 and Exercise B. 1 1, we have 

Theorem B.22: The axiom of choice is equivalent to the Maximal Principle of Zom 
( either version!). 
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As mentioned at the beginning of this appendix, Zermelo used the axiom of 
choice to prove his Well-Ordering theorem. 

Definition B.23: A poset E is called well-ordered if whenever S is a nonempty 
part of the objects of E then S has a minimum object. 

Exercise B.24 

Show that a well-ordered poset E is a total order. 

Hint: Consider the minimum object of a two-object set. 0 

Exercise B.25 

(Zermelo 1908) Every set X can be well-ordered. 

Hint: Consider the set of all well-orderings of parts of X and order it by “initial 
segment”; in this case a maximal well-ordering must have the whole X as set of 
objects. 0 

A given surjective map has a section provided that certain associated sets arc well- 
orderable therefore the Well-Ordering theorem is actually equivalent to the axiom 
of choice. In fact, there are literally dozens of useful mathematical principles in 
topology and algebra that have also been shown equivalent to the choice axiom 
over the past 75 years. 

The higher dual distributive law of logic used in the above proof of the Theorem 
B.15 is true in many toposes but not in most. This law is dual to the usual infinite 
distributive law 

u < u a for all a entails u ^ w a < ^ u a w a 

which is true in any topos because of the existence of implication as an operation 
on subobjects. In a Boolean topos like S. the power sets are self-dual, and hence 
the higher dual law holds as well. 

Exercise B.26 

Does the higher dual distributive law imply the Boolean property? Does it hold in 
the topos S of all actions of C on abstract sets, in case C is discrete, or a group, 
or a poset? ^ 

Here are some additional exercises: 

Exercise B.27 

An irreducible /-system is not only closed with respect to sups of /-chains, but in 
fact has sups of all chains. 


Hint: Use a third instance of the higher dual distributive law. 


0 
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Exercise B.28 

Any irreducible /-system, such as P = A in Exercise B.19 has two further prop- 
erties: The endomap / is order-preserving, and every paid has a sup. 0 

Note: The latter two properties arc of importance even when / is not inflationary 
and the system is not irreducible: Tarski’s fixed-point theorem states that every 
order-preserving endomap of a poset having all sups has a fixed point. The reader 
should be able to prove it; it is easier than the Bourbaki theorem whose consequences 
we are discussing in this appendix. 

Exercise B.29 

Use Tarski’s fixed-point theorem to show that if there are parts B — A and 
A— ^B, then there exists an isomorphism A = B. This result depends on the 
further assumption that all parts have Boolean complements; it is not true for most 
categories of variable or cohesive sets. 

Hint: Construct an order-preserving endomap of 2 A by using complementation in 
both A and B and also applying both a and on a fixed “point” of that endomap, 
use /, and on the complement use a retraction for a. The construction actually can 
be applied to any pair of maps; what conclusion can then be drawn? 0 

Exercise B.30 

Using Zorn’s Maximal Principle (and Boolean logic) show that any family E — 5 - 1 
of sets makes I a chain with respect to the ordering: i < j iff there exists an injective 

eMej. 

Hint: If there is no injective E\ — >- £/ then any maximal partial injection from 
paid of Ei into E\ 


X 

E2 Ei 

will actually have its domain total, that is X — Ei. /•. 

General Remark: Inductive arguments rely on special cases of the principle that 
within any given set E as universe, if too many conflicting higher-order operations 
are considered, they will be constrained to have some relation. The simplest sort of 
such relation is a fixed point. For example, arbitrary sups and an order-preserving 
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endomap are sufficiently conflicting, according to the Tarski theorem, as are sups 
of chains and an inflationary endomap, according to the Bourbaki theorem. Some 
systems of operations, even higher-order ones, arc sufficiently harmonious to permit 
“freedom” from such unexpected relations, as the following example shows: 

Exercise B.31 

Free sups exist. More precisely, given any set X, there is a poset P X that has sups 
of all parts and a map X — ^ PX such that, given any map X -A- E, where E 
is any poset having sups of all parts (no relations arc assumed in E beyond the 
definition of sups), there is a unique map PX -?->■ E that preserves all sups and 
satisfies (p'rj — cp. 0 

Of course systems of finitary operations that arc free of any unexpected relations 
do tend to exist because of the assumption that a natural number system exists on 
some set. There arc many valiants of that theme, for example: 

Exercise B.32 

Consider systems E involving a nullary operation 1 E and a unary operation 

f 

E — ■*- E subject to the condition that /(oo) = oo; a category is obtained by consid- 
ering that a homomorphism between any two such systems is a map of the carrying 
sets that preserves both these operations. Show that there is a free such system N' 
generated by the one-point set, i.e. there is 1 -A- N' such that for any such system 
E and for any given point 1 -A- E there is a unique homomorphism N' — A- E for 
which x'(0) = v. Does N' have any /-fixed points other than oo? (In the category 
of abstract sets, N' has an ordering for which all parts have sups, but in categories 
of variable or cohesive sets this N' fails to have all sups.) 0 

Thus, it is the combining of higher-order operations with very distinct (even 
finitary) ones that may be impossible to do in a “free” way within any fixed set. 

Example: (Gaifman [G64]) Any infinite poset with arbitrary sups and an endomap 
/ of the underlying set for which // = identity must satisfy some additional 
relation. (Actually, Gaifman showed that there are no “free” [in a sense analogous 
to those of Exercise B.31 and B.32] complete Boolean algebras except finite ones, 
which implies our statement.) 0 

The irreducible subsystems (such as A in our proof of the Bourbaki theorem) 
that help to reveal these relations seem to have the property that every element 
is somehow “reached” from below by applying the operations. It has been found 
useful (by Cantor and since) to make this reaching idea appeal - more precise by 
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introducing well-orderings along which the operations can be “infinitely iterated”. 
In the case of our irreducible systems A — P with sups of chains and an inflationary 
/, this is already achieved: 

Exercise B.33 

Consider a chain C having sups of any part as well as an order-preserving endomap 
/ such that x < fx for all x. (Then there is a smallest element 0 and a largest 
element oo, and moreover /(o o) = oo.) If C has no parts closed under / and sups 
(other than C itself), then 

fix) = X =>■ X = oo 

Can you show that C is well-ordered? 0 

Returning to Unitary algebra, the following is a typical application of Zorn’s 
Maximal Principle: 

Exercise B.34 

If A is any commutative ring in which it is false that 0=1, then there exists a 
field F and a surjective ring homomorphism A — s- F (such a homomorphism is a 
“closed point” of the space specA, which plays a key role in algebraic geometry and 
functional analysis. Since typically A has a great many surjective homomorphisms 
to other rings A — B, where also 0 f I in B [such homomorphisms correspond to 
nonempty closed subspaces of spec A], by applying the exercise to B and composing, 
we see that specA typically has many closed points). 

Hint: Surjective homomorphisms with domain A arc determined by their kernels, 
which arc A-linear subgroups of A; the codomain of such a homomorphism is a 
field iff the corresponding A-linear subgroup is maximal among those that do not 
contain 1. The A-linear subgroups have been known since Dedekind as ideals. 0 
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C.l Definitions of Some Mathematical and 
Logical Concepts 

Included here are some of the main definitions from the text. Several entries go 
beyond bare definition, in an attempt to provide a window into the historical and 
foundational background. 

Adjoint Functors : 

Let X and A be categories and F : X — 5- A and G : A — >■ X be functors. We say 
that F is left adjoint to G (or equivalently G is right adjoint to F, and write this 
F ~\ G) if for any objects X in X and A in A there is a given bijection < p(X, A ) 
between arrows from FX to A and arrows from X to GA, 

FX — ^ A 
A >GA 

which is moreover natural in X and A. This means that the following holds. Suppose 
that X' ^ X in X and FX f > A. then 

<P(X, A)(f) ox = <p(X', A)(f o Fx) 

and the similar condition holds for arrows A — ^ A': 

<P(X, A')(a o f) = a o <p(X, A)(/) 

There are many examples of this concept in the text and exercises. For exam- 
ple, consider the the diagonal functor A : S — x S , which sends a set X to 
the pair (X, X) (and a mapping / to (/, /)). The sum functor + : S x S — ^S, 
which sends (A, B) to A + B is left adjoint to A, whereas the product functor 
is right adjoint to A. Another key example is the exponential or mapping-space 
functor (— ) B whose properties follow from its definition as the right adjoint of 
(- x B). 
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In case F\ and F 2 are both left adjoint to G, there is a natural isomorphism 
F\ = F 2 , and, similarly, any two right adjoints to F arc naturally isomorphic. 


Algebraic Topology: 

The cohesive spaces of various categories (continuous, smooth, analytic, com- 
binatorial) serve as domains for cohesively variable quantities and as arenas for 
uninterrupted motion. But in particular, a space also has qualitative attributes that 
collectively might be called its “shape,” the best-known such attribute being mea- 
sured by the Euler characteristic and the Betti numbers that count the number 
of k-dimensional “holes” in the space. (The qualitative attributes of a cohesive 
space can make a crucial difference when dealing with the differential and inte- 
gral calculus of quantities that vary over such a space as domain; such quantities 
arise in electromagnetism and statistical mechanics, and the precise effect of the 
shape on their behavior is expressed by theorems of de Rham and of Stokes.) 
Not only Betti and Volterra, but also Vietoris and Noether, as well as Hurewicz 
and Steenrod, and many others, devised algebraic and combinatorial construc- 
tions for getting at these qualitative attributes. Those constructions were varied 
and sometimes complex so questions naturally arose: Do they always give the 
“same answer,” and can they be adequately conceived in a more direct manner? In 
1952 Eilenberg and Steenrod showed how to answer those questions by unifying 
the subject in a way that also made possible the ensuing 50 years of remarkable 
advance. Their use of the axiomatic method resulted from participating in the on- 
going development, extracting the essential features, and making these features 
explicit in order that they could serve as a basis and guide for further work in the 
field. 

The analogy between set theory and algebraic topology goes even further. 
Namely, in both subjects the examples first considered satisfy a special axiom 
concerning the one -point space 1: In the abstract-set categories there arc enough 
maps from 1 to separate maps between any two sets, and in singular cohomology, 
the cohomology of 1 vanishes in higher dimensions. B ut later important uses of both 
axiom systems have involved mathematically rich situations in which those special 
axioms do not hold. When it was first realized that these special axioms were too 
restrictive, one spoke of “generalized elements”, “generalized cohomology theo- 
ries,” respectively but as the central role of these situations became established, the 
word “generalized” was gradually dropped, and now we speak just of “elements” 
(i.e., figures of various basic shapes, not just punctiform) and of “cohomology the- 
ories” (which do not necessarily vanish in higher dimensions even for the one-point 
space). In the case of set theory we approach these more general situations through 
our investigations of variable sets. 
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Category Theory: 

Category Theory was made explicit by Eilenberg and Mac Lane. In 1945 [EM45] 
they concentrated some of the essential general features of the developments up 
to then in algebraic topology and in functional analysis, which are still-developing 
branches of the science that finds and uses the relation between quality and quantity 
in the study of space and number. A further development of category theory came in 
1958 [Kan58] when Kan made explicit the notion of adjoint functors, which were 
then rapidly seen to be ubiquitous (even if implicit) in mathematics. In the 1960s 
and 1970s it was established in detail that the mathematically useful portions of 
set theory and of logic in the narrow sense can be seen as part of category theory 
[La69b], [KR77], [FS79], 

Characteristic Functions (2.20): 

Suppose we have fixed an element 1 — V of a set V. (In the case of abstract sets 
we will often take V to be a two-element set and call the distinguished element 
“true”.) Then a paid i of a set A is said to have a map A - >- V as characteristic 
function iff the elements of A, which are members of i , are precisely those to which 
tp gives the value true. In symbols, 

VT A [n e i <=>• cpa — vi T] 

(here v\T denotes the composite map T — >■ 1 — U- V, which is constantly true). 
That is, i = cp~ 1 [ iq | is the inverse image of the one-element paid “true” of V along 
<p. In the category of sets, every paid i has a unique characteristic function tp (to 
V = 2), and for any map / the two ways of substituting / agree: 

Vf-Hj] = <Pjf 


Composition: 

Value of the Composite Mapping / Associative Law 

Vx[(gf)x = g(fx )] 



T — 1 elements/values 
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T — arbitrary set : the associative law 


Contrapositive : 

The contrapositive of a statement “P implies Q” is the statement that 

“not Q implies not P” or in symbols: (—>Q =>■ -<P) 

If the original implication is true, then the contrapositive is always true also, and 
in classical logic if the contrapositive is true, we can conclude that the original 
statement was true (the latter procedure being often referred to as proof by con- 
tradiction). For example: The statement that “1 is a separator in the category of 
sets” in its contrapositive guise assumes the following form (which is sometimes 
understood as the definition of equality for mappings): 

[Vl^X [f 1 x = f 2 x]]=>f 1 = f 2 

Of course, to recognize the expression above as the contrapositive of the sepa- 
rator condition involves one more ingredient beyond the notion of contrapositive 
itself, namely, the recognition that existentially quantified statements arc negated 
as follows: 


-3 xP = Vx-'P 


Converse of an Implication : 

The converse of a statement of the form 

P implies Q (or in symbols: P Q) 


is the statement 


Q implies P (or in symbols: Q =>■ P) 

Even when the original statement is true, the converse statement may or may not 
be true, depending on the case. 
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Element (1.3): 

• (in the narrow sense) An element of a set A is any mapping whose domain is 
1 and whose codomain is A. This mapping is also sometimes referred to as the 
indication of an element, but we will not distinguish between elements and their 
indication. 

• (in the generalized sense) An element of A is just another word for any mapping 
with codomain A; such a generalized element may also be referred to as a 
variable element or as a parameterization or listing of elements. 

Epimorphism / Right Cancellation (4.5): 

This concept is strictly “dual” to that of monomoiphism. We say / is an epimor- 
phism iff it satisfies the right-cancellation property 

V<P),<p2 [(Plf = <P2f^<Pl = (Pl\ 


Foundation: 

A foundation makes explicit the essential general features, ingredients, and oper- 
ations of a science as well as its origins and general laws of development. The 
puipose of making these explicit is to provide a guide to the learning, use, and 
further development of the science. A “pure” foundation that forgets this puipose 
and pursues a speculative “foundation” for its own sake is clearly a nonfoundation. 

Foundation, Category of Categories as: 

Among the general features of mathematics a foundation should make explicit are 

(1) the nature and workings of the logical and algebraic theories within which we 
reason and calculate; 

(2) the nature and workings of the set-universes, whether abstract and constant, 
or cohesive and variable, that we imagine as an objective or geometrical back- 
ground; 

(3) the nature and workings of the interpretations , known as structures or models, 
of theories into set-universes. 

These requirements have forced mathematics toward the foundational view we 
have tried to convey in this glossary and in this book; this can be seen in the following 
way: The “category” of models of a given theory in a given set-universe must form 
something like a mathematical category since these structures arc different realiza- 
tions of the one single system of features required by the theory, and hence there must 
be ways of comparing different ones by “morphisms” or maps that “preserve” those 
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features. Something of that sort applies roughly to concepts generally, but in math- 
ematics it is quite definite. For example, consider a theory A of addition and multi- 
plication, which in an appropriate background universe U has many models such as 

<2( 1) = a system of constant quantities; 

Q(S) = a system of quantities varying over a region S of space; and 
Q(T) = a system of quantities varying over an interval T of time. 

Then a specified motion m : T — >- S will yield a morphism that we name m* : 
Q(S) — p, Q(T), and a specified instant t : 1 — >- T of time will yield a morphism 
t* : <2(7’) — <2(1), such morphisms being structure-preserving maps for the 
theory A. The background universe U must also be a category since morphisms 
of spaces (or sets) arc the ingredients in the morphisms like t* , m* of structures. 
A key discovery around 1962 (foreshadowed by the algebraic logic of the 1950s) 
was that a theory A is also a category! That is because the most basic operation 
of calculation and reasoning (which as it turned out uniquely determines the other 
operations) is substitution, and substitution correctly objectified is composition. 
(The symbolic schemes often called “theories” are a necessary but not uniquely 
determined apparatus for presenting theories [in the usual algebraic sense of 
presentation].) It seems that two things (like A and U) cannot be concretely related 
unless they can be construed as objects of the same category; fortunately, in this 
case we have that both A and U are categories, and so a relation like Q : A — »- U 
is a functor in a category of categories. Moreover, the moiphisms between different 
models are just natural transformations between functors. These considerations 
put a mathematical focus on some questions that a foundation needs to address. 

Function : 

Although the word “function” is sometimes used in a way synonymous with “map- 
ping,” more frequently it is used to describe maps (in some category) whose 
codomains arc some special objects V deemed to have a “quantitative” charac- 
ter (for example V — 2 or V = £2), and so one speaks of, for example, “the algebra 
of smooth complex-valued functions” on any smooth space. The V -valued func- 
tions on any object will always form an “algebra” (in a general sense) because the 
maps V — s- V, V x V — >■ V, and so on, will act as operations on them. 

Functor (10.18): 

Let A and B be categories. A (co variant) functor F, denoted F : A — >- B from 
A to B is an assignment of 

• an object F(A) in B for every object A in A 

• an arrow F(f) : F(A) — >■ F(A') in B for every arrow A — > A' in A 
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subject to the following equations: 


• ^(1a) = 1 F( A) 

. F(gf ) = F(g)F(f ) whenever A^A'-^A" 

A contravariant functor F : A — assigns objects to objects, and an arrow 
F(f ) : F(A ') — s- F(A) (note direction!) in B for every aiTow A — >- A' in A. It 
satisfies the equation F{gf) — F(f)F(g) instead of the second equation above. 


Inclusion (2.13): 

The notation 


i' Qa i 

means both i, i 1 are parts of a set A and 

3k \i’ = ik ] 


which is equivalent to saying 


Vfl[a e i r => a e /] 

When the set A is understood, we write simply i 1 c i. 


Intersection (2.36): 

If i\, ig are parts of A, then their intersection denoted 

i\ D i 2 

is also a paid j of A characterized up to equivalence by having as members precisely 
those elements of A common to i\ and /2 

(V T -A>- A[a e j a e ii & a e / 2 I) j = i\ n F 

Among the many ways that intersection can be characterized is as the “inverse 
image” along i \ of A. 


Inverse Image (2.5): 

If A ^ B is any mapping and j is any paid of B, the condition “/ e j” is typically 
not outright true but does have a “solution set”: the paid of A for which / e / holds, 
more exactly the part i of A whose members are precisely all those elements a of 
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A for which fa e j . In symbols: 

vr A [a ei <^=v fa e j] 

^i = f~ l [j] 

We call f~ l [j] the inverse image of j along /. 

Injective / Monomap /Left-Cancellation /Part: 

All four of these terms mean essentially the same thing but with different gram- 
matical shadings as in “A huge person is a giant.” 


Injective (1.5): 

Given an object I, we may say that / is /-injective iff / has the left-cancellation 
property for all test pairs with domain I : 

a\ 

Vn i, at [/ >-A & fa\ = fa 2 =>■ a\ — a n] 

02 

Since the definition of ‘“monomorphism” contains the phrase “V 7 \” it is obvious that 
any monomorphism is always /-injective. The converse proposition, namely that 
/-injective => monomorphism, holds if / is a separator, as is easy to prove. In the 
case of the category of abstract sets, we usually take / = 1 and refer to 1 -injective 
mappings simply as injective. Thus injective mappings / are usually considered 
‘“by definition” to satisfy 

fa\ — fan => a 1 = 02 

where a\, 02 are elements (in the narrow sense) of the domain of /; however, since 
1 is a separator, injective mappings in fact satisfy the full left-cancellation property. 
In contrapositive form, 

Vfli, a 2 [a\ 7^ <72 =>■ fa\ 7^ /fl 2 ] 

describes injective/monic /, thus to show that a certain / is not injective, it suffices 
to exhibit that 

02 Vfa\ = f 02 & a 17^ 02] 


Left-Cancellation /Monomapping (2.6): 

The statement / has the left-cancellation property means that for any a\ and an 
for which f a\ = fa2 we can conclude a\ — 02, or in symbols 


VT V«! Vfl2 \fa 1 — f 02 =>■ a\ = an] 
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as in the diagram 


T 



If / has the left-cancellation property in a certain category, we say that / is a 
monomorphism in that category. Since the moiphisms in the category of sets 
are called mappings, the monomorphisms in that category may also be called 

monomappings. 


Logic : 

The science of logic in the ancient philosophical sense means the study of the general 
laws of the development of thinking. Thinking (1) reflects reality (i.e., has a content) 
but also (2) is itself paid of reality and so has some motions that arc oblivious to 
content. Therefore the science of logic finds two aspects of thought’s motion: (1) the 
struggle to form a conceptual image of reality that is ever more refined, whose laws 
we may call objective logic, and (2) the motion of thought in itself (for example the 
inference of statements from statements), whose laws we may call subjective logic. 
Although grammar and some aspects of algebra might be considered as subjective 
logic, we will limit ourselves to the paid we will sometimes refer to as logic in the 
narrow sense - that which is related to the inference of statements from statements 
by means dependent on their form rather than on their content. (Logic in the narrow 
sense is explained in more detail in Appendix A.) 

Logic in the narrow sense is useful (at least in mathematics) if it is made explicit, 
and the work of Boole and Grassmann in the 1840s, Schroder in the late 1800s, 
Skolem in the early 1900s, Heyting in the 1930s (and of many others) has led to a 
high development, most aspects of which were revealed to be special cases of adjoint 
functors by 1970 [La69b], The use of adjoint functors assists in reincorporating the 
subjective into its rightful place as a paid of the objective so that it can organically 
reflect the objective and in general facilitate the mutual transformation of these two 
aspects of logic. 


Logic, Objective : 

The long chains of correct reasonings and calculations of which subjective logic is 
justly proud are only possible within a precisely defined universe of discourse, as 
has long been recognized. Since there arc many such universes of discourse, think- 
ing necessarily involves many transformations between universes of discourse as 
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well as transformations of one universe of discourse into another. The results of 
applying logic in the narrow sense to the laws of these objective transformations are 
necessarily inadequate; for example, such attempts have led to the use of phrases 
such as “let X be a set in which there exists a group structure,” which arc essentially 
meaningless. Rather than using “there exists” in such contexts, one needs instead a 
logic of “given.” Before category theory, at least one systematic discussion of the 
laws of these objective transformations was given by Bourbaki, who discussed how 
one structure could be deduced from another. The concepts of categories, func- 
tors, homomorphisms, adjoint functors, and so on, provide a rich beginning to the 
project of making objective logic explicit, but there is probably much more to be 
discovered. 


Logic, Intuitionistic : 

A. Heyting, in the 1930s, developed a logical algebra that happens to be applicable 
in any topos. This logic is weaker than the Boolean one because it allows for 
the possibility of motion or internal variation in the sets or universes to which it 
is being applied, and thus the typical Boolean inference (or subobject inclusion) 
“not(not A) entails A” is usually not correct. Because Heyting’s work was intended 
to model the “constructivist” philosophy of the intuitionist Brouwer, Heyting’s 
logic is sometimes incorrectly referred to as intuitionistic logic, and for that reason 
topos theory is sometimes referred to as constructive set theory; neither of these 
does justice to the true generality of Heyting’s discovery or to Grothendieck’s 
theory. There arc a few toposes that have been specifically constructed in an attempt 
to understand the constructivist philosophy, but most toposes have little to do 
with it. 


Mapping ( 1 . 1 ): 

In the category of abstract sets, / is a mapping with domain A and codomain B 
if for every element x of A there is exactly one element y of B such that y is the 
value of / at v. In symbols, 

V 1 — A 3!l^B[y = fx] 

and as a diagram 


1 





A 


f 
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Membership (2.15): 

The notation a Ga i means i is a part of a set A and 3d \a = id] as in the diagram 

T- >- U 

\ p 

A 

When, as is usually the case, the set A is understood, we write simply a e i. 


Natural Transformation (10.19): 

Let F, G : A — be functors. A natural transformation r from F to G is the 
assignment of an arrow x A : FA — >■ GA in B for each object A of A subject to the 
requirement that for each arrow A, A, in A the following square commutes: 


FA, 

Fa 

FAo 


T A i 


t - 4 2 


GA, 

Ga 

■ga 2 


The allows x A are called the components of the natural transformation. In case 
each x A is an isomorphism in B, then x is called a natural isomorphism; in that 
case there is an inverse natural transformation. 

There arc many examples in the text and exercises - notably the natural map 
treated in Section 8.3, the singleton (see Exercise 8.9), and the arrows in the cate- 
gories of Al-sets in Section 10.2. 


One-Element Set (Section 1.2): 

The set 1 is characterized by the fact that for any set A there is exactly one mapping 
with domain A and codomain 1 . In symbols, 

VA 3! [A — Ir 1] 


that is, the statements below arc both hue: 


VA 


EXISTENCE (3) 3/ [A -A 

AT MOST ONE (!) V/, g [A : 


■ 1 ] 

/ 


1 =>f = g] 


g 
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Part (2.11): 

By a part of a set A is meant any monomapping i whose codomain is A. Nota- 
tion: U c — >■ A . Parts of A are called equivalent if they are isomorphic as objects 
of the slice category S / A. 


Separator (1.14): 

A set T is a separator if there are enough mappings with it as domain to separate 
pairs of mappings with arbitrary domains; i.e. when A is an arbitrary set and if f\ 
and f 2 are any two mappings with domain A and some common codomain B, if f \ 
is distinct from fo there should exist a mapping x with domain T and codomain A 
such that f\x is distinct from f 2 x. An important property of the category of abstract 
sets is that the terminal set 1 is a separator. In symbols, 

VA WB fifi, / 2 [[A=JB & fi + fi\ =>• 3 1 A[f x x ± f 2 x]] 

h 

as in the diagram: 


A 



Set Theory: 

Set theory was started in the late 1800s when Cantor made explicit an abstraction 
process (arising from his own work on Fourier analysis) and applied the idea of 
isomorphism (which he had learned from the work of Steiner on algebraic geometry) 
to the results of this abstraction process. Some of Cantor’s followers did not fully 
appreciate this abstraction process (in editing Cantor’s works for publication they 
issued judgments of it such as “kein gliicklicher Versuch” and “um keinen Schritt 
weitergekommen,” p. 35, edition of 1932 [C66]). For this and other reasons, too 
much of the technical development of set theory has been rather detached from its 
origins in the specific requirements of functional analysis and algebraic geometry. 
Thus, the great functional analyst and algebraic geometer of the 1950s and 1960s, 
Grothendieck, made only peripheral reference to set theory as known at the time 
when he devised topos theory, an explicit theory of the categories of cohesive and 
variable sets as they actually occur in geometry and analysis [AGV72]. In 1970 the 
essential paid of Grothendieck’s theory was drastically simplified in response to the 
needs of continuum mechanics and algebraic topology [La71], [T72] (see [Joh77], 
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[BW85], [MM92]). These developments made the bridging of the previous gap 
between axiomatic set theory and naive set theory possible. 

Set Theory, Axiomatic : 

The use of the axiomatic method, to make explicit the basic general features of 
the application of set theory to the study of mathematical subjects was delayed for 
much of the twentieth century by the attempt to popularize a certain philosophical 
view; according to that view the intersection of any two unrelated sets should have 
a well-defined meaning. That contrasts with the phenomenon noted in practice 
that inclusions, and more generally, relations between sets, are effected by specific 
mappings. The description of this “cumulative hierarchy” came to be considered 
by some as the only possible sort of formalized set theory. 

Set Theory, Naive: 

A certain body of set-theoretical methods and results is required for the learning 
and development of analysis, geometry, and algebra. Because of the insufficiently 
analyzed belief that this body could not be made logically precise without the 
cumulative hierarchy and global inclusion, it became customary to describe it in a 
“nonformalized” manner known as “naive set theory” [H60]. One of the puiposes 
of this book is to overcome this division between naive and axiomatic aspects by 
giving formal axioms sufficient to describe the applicable aspects of the set theory 
that has been developed by Dedekind, Hausdorff and their many successors. For an 
earlier version, see [La64], 

Set Theory, Parameterization : 

Many of the applications of naive set theory could be described as based on “para- 
meterization”: a set is used to parameterize some things, some mappings on the set 
are deemed to express relations among the things, calculation and reasoning arc 
applied to these maps, and the result is used to guide our dealing with the original 
things. Since in particular set theory should be partly applicable to itself, several of 
the axioms can be motivated by the observation that they simply assert that there arc 
enough sets to parameterize some basic set-theoretic “things,” as in the following 
three cases. 

A set T can parameterize 

(1) elements of a set A, by means of a map T — A; a bijective parameterization 
can be achieved by taking T = A and 1 a as the map; 

(2) pairs of elements of A, B by means of a pair of maps T — A, T — B \ here 
a bijective parameterization has T — A x B and the projections as the maps; 
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(3) maps A — >■ B by means of a map T x A — h B\ a bijective parameterization 
is possible if we can take T = B A and evaluation as the map. 

Such parameterizations are the subject of the related ideas of universal mapping 
property, representable functors, and adjoint functors developed by Grothendieck, 
Kan, and Yoneda in the late 1950s. 

(Sets [or spaces] themselves arc T-parameterized in geometry by using a map 
E — T with the fibers E, being the sets parameterized; Cantor’s diagonal argument 
showed that no single T can parameterize all sets.) 

Surjective (1.4): 

This concept is not logically dual to injective since it is an existential condition rather 
than one of cancellation. Hence, the theorem that for some categories “surjective” 
and “epimorphic” coincide will tell us something special about those categories. 
The exact notion of “surjective” is related to a chosen object I (such as / = 1 in 
the case of abstract sets). We say / is /-surjective iff 

/ 

Wy3x[fx = y ] 



where x,y arc both supposed to have domain I but respective codomains A. B. 
That is, every /-element of the codomain is required to be a value of / (for at least 
one x). 

To show that epimorphism =► 1 -surjective in the case of abstract sets, suppose 
that / has the right-cancellation property but that (aiming toward a proof by con- 
tradiction) there is a y that does not appeal - as a value of /. But then if we take 
as (pi the characteristic function of y and as (po the constantly “false”: B — *-2, it 
follows that (pif — <p 2 f but <p\ (po, contradicting that / has right cancellation. 
Hence, there is no such y, that is, / is surjective. 

Topos : 

Since 1970 [La71], the geometrical constructions of Grothendieck in SGA 4 
[AGV72] were unified with the constructions of the set theorists Cohen, Scott and 
Solovay [B77] by the observation that both are largely concerned with categories 
X that 

(1) have finite products and in which, moreover, there exist 

(2) for each object A, a right adjoint ( ) A to the functor A x ( ), and 
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(3) a map 1 G , which is a universal part in the sense that for any part 
ot : A c a- X of any object in A', there is a unique 1 — 9 >■ £2 such that 


A 


A 


1 


true 


X 


v 


n 


is a pullback; i.e. for any T — X 

x belongs to a iff tpx = true T 

This is sometimes called the “elementary” theory of toposes since the three axioms 
are finitary and internal to X. 


Topos and the Cantorian Contrast : 

The cohesive and variable sets (“spaces”) arc persistent ideas collectively developed 
as part of our coping with the world of matter in motion of which we are an 
interacting part. Cantor and subsequent mathematicians have found it useful to 
apply something like the ancient Greek ideas of arithmos and chaos by analyzing 
this variation and cohesion through contrasting it with constancy and noncohesion 
y, wherein objects are distinct and fixed enough to have a property like number 
but nearly devoid of any other property. (Cantor further conjectured, in effect, that 
if “nearly devoid” could be idealized to “totally devoid” for a topos y of abstract 
sets, then y would satisfy some strong special properties such as the continuum 
hypothesis - a conjecture apparently proved in the 1930s by Godel, although that 
interpretation is still not widely accepted.) Mathematical practice made clear - that 
cohesion and variation occur in many diverse but related categories X. Construing 
some of those categories as toposes makes it possible to study the contrast arising 
in Cantor’s abstraction process as a geometric morphism X This process 

T* of extraction of pure points unites two opposite inclusions of y into X: the 
subcategory T* of discrete spaces with zero cohesion and zero motion and the 
subcategory T ! of codiscrete spaces with total (but banal) cohesion and motion. 
These opposite but identical subcategories provide a zeroeth approximation to the 
reconstruction of any space X by placing it canonically in an interval 

r*r*x^x — ^r ! r*x 

(In fact closer approximations to X involving narrower intervals are often obtainable 
by geometric morphisms involving a less abstract topos y', but we do not discuss that 
here.) There are no nonconstant X -maps from a codiscrete space to a discrete space; 
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this sharply separates the two aspects (of a ^-object) that the elements are totally 
distinguished and yet have total lack of distinguishing properties; these aspects are 
thus revealed to form not a conceptual inconsistency but a productive dialectical 
contradiction. 

Topos, Geometric Morphism : 

A pair of adjoint functors 

1 * 

with T* left adjoint to T* is called a geometric morphism X — in case F* 
preserves finite limits. The latter “exactness” property will automatically be true 
in case there is a further left adjoint Ti to T*. (Note that this string TH PH T* 
is in objective analogy with the string 3 H substitution H V of logic in the narrow 
sense.) When T i exists, the moiphism T is often called “essential”. Grothendieck’s 
compact notation is: asterisks for functors that are present for every T, exclamation 
points for functors that are present only for notably special T, whereas the lower 
position (for either) denotes functors deemed to have the same direction as T and 
the upper position denotes functors having the opposite direction to T . In case a 
further right adjoint T ! to T* exists, a special geometric moiphism $ results in 
the opposite direction with <F* = T* and <E>* = T'; such a <t> may be called “flat” 
because the left adjoint d>!(= T*) not only exists but preserves finite limits. 

All four functors exist in the case X = y 1 , the category of reflexive graphs in a 
topos y (see Section 10.3), where X — 1 is determined by T* = the inclusion 

of trivial graphs. 

Topos of Abstract Sets : 

For a topos U the conditions that 

(1) epimorphisms have sections (axiom of choice), and 

(2) there are (up to isomorphism) exactly two parts of 1, 

are generally sufficient to exclude most traces of cohesion or variation, as Cantor 
proposed. (It follows that 2 = Q and that 1 separates.) 

Occasionally the (provably stronger) Generalized Continuum Hypothesis of 
Cantor may be further imposed: 

(GCH) Whenever monomorphisms A — >■ X — >■ 2 A exist, then an isomorphism 
A = X or X = 2 a also exists. 

Usually a topos of abstract sets is assumed to satisfy (1) and (2) and also 
Dedekind’s axiom concerning the existence of a set of natural numbers: 
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(3) The forgetful functor U' - A U has a left adjoint, where U' is the category 
whose objects are the objects of U equipped with an action by an arbitrary 
endomap. 


Topos, Grothendieck : 

For the development of number theory, algebraic geometry, complex analysis, and 
of the cohomology theories relevant thereto, it was found necessary in 1963 to 
introduce explicitly certain categories that Grothendieck and Giraud called “U- 
toposes.” (“Topos” is a Greek term intended to describe the objects studied by 
“analysis situs,” the Latin term previously established by Poincare to signify the 
science of place [or situation]; in keeping with those ideas, a U -topos was shown 
to have presentations in various “sites”.) 

In modern terms a ZY- topos is a topos X equipped with a geometric morphism T 
to a topos U of abstract sets satisfying the condition that X is boundedly generated 
over U in the sense that some single map B — >- V* I generates X. In the cases 
where one can choose 7 = 1, the generation condition means that the map 

,l'( />’. X ) • />’ >.Y 

is an epimorphism for all X, where by definition 

X(B, X ) = T*(X b ) 

U ■ B = (T*U) x B 

The functor T* = ATI, — ) is sometimes called the “global sections” functor to 
remind us that the objects of X may be spatially more valuable than those of U, or 
the “fixed points” functor to remind us that the objects of X may be dynamically 
more active, or the “underlying discrete set” functor to remind us that they may be 
more cohesive than those of U. 

As an application of Grothendieck’s powerful method of relativization, it is very 
useful to consider Zf-topos also in the more general case where U itself is an arbitrary 
topos not necessarily consisting of purely abstract sets. 


Union : 

The hierarchical philosophy led to the identification of the notions of set and of sub- 
set (part). This in turn led to a confusion between sums (of sets) and unions (of parts 
of a given set); for example, the editors of Cantor’s collected works [C66] com- 
plained that he had defined the sum only in the case of disjoint sets. If A, c >■ X 

is a given family of parts (for example a family consisting of only two parts), then 
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the union of the family is a part of X obtained by (epi-mono) image factorization 
of the induced map from the sum 

A = EiAi Ui Af x 

In toposes, any epimorphism is a coequalizer of the induced equivalence relation 
given by the self-pullback A x x A g . A. In turn this relation can be expressed 
as 

£ ijAi x x Aj^^'EAi 

(where each summand A, x x A 7 is an intersection of parts and so is again a part of 
X ). By the universal mapping property of coequalizers, any part B — X to which 
each of the given parts belongs will also contain the union; i.e. in the category of 
parts of X 

LJA,- c B 
A, C B, for all i 

are equivalent; the objective form of the rules of inference for disjunction (= re- 
peated “or”). However, the universal mapping property of the union is stronger than 
that since even if B is an arbitrary object (not necessarily the domain of a paid of 
X), any given family A, — ^ B of maps (not necessarily monos) will be derivable 
from a unique single map (J,- A/ - >- B provided only that the equations 

Ai x x Aj =- Aj, 

ft 


expressing agreement on the overlap arc satisfied by the given family /f, . 

Yoneda Embedding : 

A functor Q : A — s-«S is called representable if there is an object 4 in A and 
an element q of the set Q(A) so that for all B in A, the map A(A . B) — >■ Q(B), 
which takes any A — ^ B into Q(b)(q), is an isomorphism of sets. Equivalently, 
there exists a natural isomorphism of functors A( A , — ) = Q, where A(A , — ) is the 
very special functor A A— s- 1 S whose values “are” (i.e., perfectly parameterize) the 
maps with domain A provided by the category A. It is a consequence of Yoneda’s 
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lemma that any two representing objects for the same functor Q are isomoiphic, and 
indeed there is a unique isomorphism a between them such that the induced Q{a) 
takes the “universal element” q given with one representation into the universal 
element given with the other. For any map a (not necessarily an isomorphism) from 
A i to A 2 in a category A, the composition of maps induces a corresponding natural 
transformation a* : A(A 2 , — ) — >A(A 1 , — ) (see Exercise 10.22) in the opposite 
direction between the corresponding representable functors. Thus, there is a functor 

A o P ^ s A 

called the Yoneda embedding, defined by Y(A) = A(A , — ). 

The very same construction applied to the opposite category yields the notion of 
contravariant representable functor _4 op X :~S as one for which X(B) = A( B , A) 
naturally for all B but for a fixed A that is said to represent X . Because the resulting 
Yoneda embedding 

A^*S a " 

is full and faithful (see Yoneda’s lemma), one often suppresses any symbol for it and 
considers A to be simply a special subcategory of S A P ; and conversely, subcate- 
gories C of S A P that contain this A often embody very useful generalizations of the 
concept that A itself embodied. The special cases in which A is a monoid or a poset 
were already studied and used over a century ago: Cayley observed that a canonical 
example of a left action of A is the action of A itself on the right, and Schroder, 
Dedekind, Cantor, and others made much use of the canonical representation of 
a poset as inclusions between certain subsets (actually right ideals) of a given set 
(namely of its own set of objects). Note that the category 2 with one nonidentity 
arrow (see Section 6.2) embeds in S as the sets 0 and 1 and that the Yoneda 
embedding of a given category A actually belongs to the induced subcategory 



if and only if A is a poset. 

Yoneda’s Lemma : 

For any contravariant functor _4° p X >-S to sets and for any given object A, the 
value X{A) is isomoiphic as a set to the (a priori “big”) set of natural transforma- 
tions Al(A, — ) — ^X (see Exercise 10.24). When we introduce some still larger 
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categories via exponentiation in the category of categories, wherein S is an object, 
this says more fully that the functor “evaluate at A” 

is one of the representable functors on S , indeed it is representable by the repre- 
sentable functor on Al op . Corollaries of Yoneda’s lemma include the fullness of the 
Yoneda embedding, the uniqueness of representing objects, and the fact that “any” 
set- valued functor is canonically a colimit of representable ones. In case A itself is 
small relative to S (which means that the objects of A can be parameterized by an 
object of S and that A has small hom-sets), then the latter is equivalent to the fact 
that any X has a presentation as a coequalizer of two maps between two infinite 
sums of representable functors. 


Yoneda and Totality : 

For any category A such that the arrows between any two objects of A can be 
parameterized by an object of S, we say that A has small hom-sets. 

A category A with small hom-sets is totally cocomplete if its Yoneda embedding 
Y* = Y has a left adjoint Y* : S A P — »-A. For example, the category of groups or 
the category A = S of C-sets are totally cocomplete; Y* too may have a left 
adjoint Y\. For example, C-sets, but not groups, have such an “opposite” embedding 
Y\ into the same functor category. One and only one category A has both a further 
adjoint to Y, and also a still further left adjoint to that, i.e., an adjoint string of 
length 5 

U AV AY, AY* AY* 

starting with Y* = Y on the right, relating A to S A P , namely A = S, the base cate- 
gory of sets itself [RW94]. There are several subcategories of the functor category 

C°P 

o that still unite the opposites Y, and Y*\ for example, the category of simplicial 
complexes that are important in combinatorial topology. Another subcategory of 

C°P 

o containing Y* as the codiscrete objects is the category of bornological sets, 
important in functional analysis, where X : <S op — 5- S is considered as a structure 
in which X(S) = Bor(S, X) acts as the set of “bounded” S-parameterized parts 
of X. 
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C.2 Mathematical Notations and Logical Symbols 

— >- denoting a map (also called transformation or function) 

from a domain A to a codomain B. A — >■ B 

=>• A =>• B (A implies B) or (A only if B) or (if A then B) 

<^= A <= B (A is implied by B) or (A if B) or (if B then A) 

A o B also written “iff” (A if and only if B ) 

A =$■ B & B =$■ A (A implies B and B implies A) 

i — > value at 



a map that has an inverse (called isomorphism) 
representing the domain as a part of the codomain 


equals 

equivalence of parts 

for all 

there exists 
(existential statement) 

membership 
(a member of) 

greater than 

infinity 

entails 

and, also & 


^ not equal 

c inclusion 

(of a part in another part) 

! unique 

(= exactly one) 

-■ not 

D intersection 

(of parts) 

< smaller than 

£ sum 

/ integral sign 

V or 


fg 

\ 


composite of two maps 
(read “/ following g”) 


( ) notation for ordered pairs 
or triples or n -tuples 


but not 


9 boundary of 
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C.3 The Greek Alphabet 

Since the Latin letters a, b, x, y, and so forth, arc sometimes not enough to express all 
the different mathematical entities that may arise in a given discussion, Greek letters 
arc also used in all fields where mathematics is applied. Usually their meaning is 
specified at the beginning of each discussion just as is done for the meaning of the 
Latin letters. The Greek letters themselves are pronounced as follows: 


a 

alpha 

V 

nu 

P 

beta 

§ 

xi 

y 

gamma 

o 

omicron 

8 

delta 

It 

Pi 

€ 

epsilon 

p 

rho 


zeta 

a 

sigma 

>1 

eta 

T 

tau 

e 

theta 

V 

upsilon 

L 

iota 

V 

phi 

K 

kappa 

X 

chi 

X 

lambda 

i ft 

psi 

/X 

mu 

CO 

omega 


This list contains the lowercase Greek letters. The uppercase sigma and pi are 
often used to denote sum and product and are written S and IT Other frequently 
occurring uppercase Greek letters arc Q (omega), A (delta), T (gamma), T (psi), 
<f> (phi), and X (chi). 
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constant map, 168 
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map, 65 

functor, 109 
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Dirac delta, 126 
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domain, 2 


dual 

category, 25 
proof, 67 
property, 21 
duality, concrete, 120ff 
dynamical system, 155 

Eilenberg, v, 233 
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generalized, 16 , 235 
elementary topos. 111 
empty set, 12 
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entails, 196 

versus implies, 198 
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equality, 

rule of inference for, 209 
equalizer, 66 
equinumerous, 57 
equivalence relation, 89 
equivalent parts, 40, 138 
equivalent categories, 46 
equivariant, 77 
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excluded middle, 200 
existential quantifier, 20 1 ff 
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expectation, 149 
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external diagram, 3 

/-chain, 224 
family, 5 
feedback, 186ff 
control, 187 
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fibered product, 69 
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field, 213, 230 
finite state machines, 
category of, 24 
finite, Dedekind, 58 
fixed point, 131 , 224 
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for all, 36, 203 
foundation, v, 235 

category of categories as, 235 
Fourier transform, 126 
Fraenkel, 130 
free monoid, 77 
Frege, 130 
function, 236 
functionals, 101 
functor, 109, 172 , 236 
representable, 248 
functoriality 
of products, 98 
of function spaces, 102ff 

Gaifman, 229 
Galileo, 57 
generalized 

element, 16 , 235 
point, 150 

continuum hypothesis, 246 
Giraud, 247 
Godel, 130 
graph, 63 , 212 
chaotic, 180ff 
of a mapping, 63, 21 Iff 
reversible, 176 
Grassmann, 239 
Grothendieck, 242 
topos, 247 
groupoid, 170 
groups, category of, 24 

Hausdorff, 175, 220, 243 
Maximal Principle, 226 
Heyting, 240 
homomorphism, 172 

ideal, 230 

idempotent, 189,217 
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arrow, 10 
law, 11 
mapping, 4 


image, 90 , 136 

existential quantification and, 
137, 139, 194, 203ff 
implication, 35, 198 
inclusion, 34 , 237 
indexed families, 45 
indicator, 38 

inference rules, 35, 195ff 
infinite, Dedekind, 58 
inflationary, 224, 228ff 
initial object, 12 
injective mapping, 8, 238 
injective object, 124 , 238 
integral domain, 216 
internal diagram, 2 
intersection, 43 , 44 , 237 
of a family, 223 
inverse image, 41 , 237 
inverse limit, 7 Iff 
involution, 168 
Isbell, 151 
isomorphic, 18 
isomorphism, 54 
of categories, 185 
iteration, 160ff 

Kan, 233 

left 

A action (set), 171 
,4-set, 171 
adjoint, 118 
adjoint functor, 231 
cancellation, 238 
Leibniz rule, 201 
lift, 82 
limit, 58, 72 
Lincoln, 13 

linear transformation, 24 
listing, 5 
localic, 93 
logic, 193ff, 239 
intuitionistic, 240 
objective, 239 
positive, 195 
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degenerate, 180 
in reflexive graph, 180 
in reversible graph, 176f 
one-lane, 177 
lower bound, 221 
greatest, 77, 221 

M-sets, category of, 77 
Mac Lane, v, 233 
mapping, 2 , 240 
mapping set, 98 
matrix, 127 
maximal object, 221 
Maximal Principle 
of Hausdorff, 226 
of Zorn, 85, 220ff 
chain version, 226 
directed version, 222 
maximum object, 221 
measurable cardinals, 151 
membership, 34 , 241 
metric space, 144 
monoid, 77 , 167 
monomapping, 238 
monomorphism, 32 , 238 
morphism, 10 
motion, 100 
multiplication, 162 

natural, 126 

isomorphism, 232, 241 
map, 172 

number object, 156 
transformation, 135, 

173, 241 
negation, 200 
nilpotent, 217 
number theory, 1 54ff 

object, 10 

one element set, 24 1 
one-to-one correspondence, 54 
operators, 101 
opposite category, 25 , 174 
order of nilpotency, 217 
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part, 33, 242 
bounded, 144 
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of, 24 
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Peano’s postulate, 163 
pointed sets, category of, 25, 46 
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posets, 169 

arise concretely, 169 
strict, 224 
power set functor 
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contravariant, 122ff 
predicate, 194 
product, 61 
property, 5, 17 
pullback, 69 
pushout, 79 

reciprocal, 213 
recursion, 157ff 
reflexive, 88 
relation, 87 

representable functor, 248 
representation of membership, 39 
restriction, 40 
retraction, 48 
right 

A action (set), 173 
adjoint functor, 231 
cancellation, 80, 235 
ring, 213 
Russell, 130 

Schroder, 239, 249 
section, 48 
self-mapping, 13 
separator, 11 , 242 
set theory, 242 
axiomatic, 243 
naive, 243 

parameterization, 243 
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singleton map, 124 , 142 
Skolern, 239 
slice category, 25 
small hom-sets, 250 
space, topological, 143 
span, 78 
sphere, 52 
split 

image, 89ff 
injection, 49 
monomapping, 49 
surjective, 82 
Steenrod, vi, 232 
Steiner, 242 

sub dynamical system, 1 64 
subposet, 220 
subring, 215 
substitution, rule of, 209 
subtraction, logical, 201 
sum, 28 
support, 112 
splits, 112 
supremum, 221 
surjective mapping, 8, 244 
symmetric, 88 
arrow, 149 

Tarski, 130, 228 
terminal object, 6 
set axiom, 6 
theory, 154 
topological space, 143 
topology, algebraic, 232 
topos, 244 

of abstract sets, 246 
and Cantorian contrast, 245 
elementary. 111 
geometric morphism of, 246 
Grothendieck, 247 


total order, 223 
totally cocomplete, 250 
transitive, 89 
truth values, 39 

represent membership, 

39,111 

represent parts, 111 
two, 27 

union, 247 

unique existential quantifier, 2 1 1 
universal 

mapping property, 26, 29 
quantifier, 203 
upper bound, 221 
least, 221 

variable 
element, 16 
sets, 17, 154 
two-stage, 114ff 
vector spaces, category of, 24 
von Neumann, 130 

Walters, 134 
weakly averaging, 149 
well ordered, 227 
Well-Ordering 

Theorem of Zermelo, 227 
well-pointed topos, 113 

Yoneda, 175 
and totality, 250 
embedding, 249 
Yoneda’s Lemma, 249 

Zermelo, 130, 220 
zero object, 45 
Zorn, vi, 220ff 



